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1.0  INTRODUCTION 


This  document  summarrzes  work  performed  by  Delta  Information 
Systems,  Inc..  (DIS)  fo'*  the  National  Communications  System  (NC5),  Off.ce  of 
Techno*og_/  and  Standards.  This  office  is  responsible  fcr  the  man.ageme-it  of  the 
.=eceral  Teleccrr.munications  Standards  Program,  which  develops 
telecommunications  standards,  whose  use  is  mandatory  for  aT  Fecera  ag=rt.e£. 
"he  purpose  of  this  study,  pe^fo'med  under  task  order  number  SS-C-CCTi-CC*  c~ 
ccnfact  Cumber  DCAl00-oS-C-007S,  was  to  investigate  the  Frcgressi.e  B.-e.e' 
Coding  Technique  "^or  the  High  Resolution  3i-ie-.'e'  Data  Ccmpress.on  Stancarc. 

■^he  p.''ogressive  bi -level  coding  technique  consists  of  repeated: / 
reducing  the  resolution  of  a  bi-ievel  image,  Rq,  creating  images  R,,  R-,...,  P.^, 
Image  F,  hav*ng  one-half  the  number  of  pixels  per  line  and  cr-e-half  the  number 
of  lines  of  image  .R,.,.  The  lowest-resolution  image,  R„,  called  the  base  layer,  :s 
t.'^ansmitted  ‘oss'essiy  'free  of  distortion)  by  binary  arithmetic  coding.  Ne.xt, 
image  Rj._,  is  t-^ansmitted  iossisssly,  using  pixels  in  R„  and  previously- 
transmitted  fcausa')  p  xeis  in  .R„_,  as  predictc.^'s  in  an  atte-mpt  to  s.'^edict  the 
next  .R^.,  pixel  to  be  transmitted.  If  prediction  is  possible  (both  transmitter  and 
receiver  are  equipped  with  rules  to  tell  whether  this  is  the  case),  the  prsd=cted 
pi.xe!  value  is  not  transmitted.  This  progressive  build-up  s  repeated  unti 
image  Rq  has  been  lossless! y  transmitted. 

1.1  Background 

.At  the  present  time,  CCITT  Recommendations  for  Grouc  4-  cermt  the 
transmission  of  only  sequentially  transmitted  black  and  white  imagery.  In 
addition,  coding  efficiency  of  Group  4  machines  suffers  for  input  pages 
containing  half-tone  information.  As  a  result  of  much  increased  commercial 
interest  from  major  computer  and  telecommunications  comparies,  there  nas  ces/i 
intense  effort  in  the  international  standards  bodies  to  seisct  a  bi-levei  image 
compression  technique  for  future  image  storage  and  communications  applicat.cns. 
The  focal  point  of  this  ac..vity  has  been  the  Joint  3i-leve!  Image  Group  (JBIG)  of 
ISQ/TEC  and  CCITT. 

The  JBIG  was  formed  in  1988  under-tffe  umbi'eila  the  ISO  wo-'king 
group  (now  I30/IEC/JTC1/SC2/V/G8  -  Coded  Representation  of  Picture  and  Audio 
Information).  It  brings  together  ISO  picture  coding  knowledge  with  CCITT 
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telecommunication  service  expertise  ffrcm  the  New  Image  Communications 
group  of  CCITT  Study  Group  Vlli).  Its  aim  was  to  select  and  develop  a 
compression/decompressicn  -echn'que  for  a  general  class  cf  bi-ievei  i-nages.  ' 
techr'que  w  M  form  the  basis  of  both  an  ISO  standard  and  a  CCIT" 
recc-mmendation. 


oc-ten* 
te!  SCO! 


A  scecification  for  a  compressicn  technique  was  formulated  fc'"  a 
a!  range  of  services  and  applications  including  facsim  !e,  audicgi-aDhrc 
fererc  ng  and  image  databases.  To  suppc^t  such  a  range  of  app!  cations 


the  techn.Que  snc-uld  be  adaptable  to  a  wide  range  of  image  resolutions  and  tc- 
/ar/ing  i.'na-ge  qualitv.  It  should  also  be  capable  of  providing  progressive 
cmulti-stage  wit*^  improving  quality)  or  sequential  image  build-up. 

At  a  series  of  JBIG  meetings,  beginning  -n  Stockhoim  ;n  July,  19SS, 
an  algorithm  was  selected  from  the  five  remain  ng  prior  to  the  Stockholm 


T.eetirc.  The  five  v/ere:  P'^ogressive  Transmission  of  Si.nary  Lmages  by 
-‘lerarchical  Coding  (BIhC);  Progressive  Encoding  of  .facsimile  Image  Using  Edge 
Decomposition  ’PEC);  Prog.'-essive  Encoding  of  Predicted  Signal  .According  to 
Classified  Pel  Patterns  (PCLAP);  .Progressive  Adaptive  Bi-level  Image 
Compressicn  (PBIC);  and  Progressive  Coding  Scheme  Using  Block  Reduction 
Cpcss;-. 


The  selected  algorithm  performs  im.age  I'educt'on.  typical  prediction 
i^PJ,  deterninistic  pi'ediction  (DP)  and  binary  arithmetic  encoding  and  decoding. 
The  image  reduction  algorithm,  called  PRES  (Progressive  Reduction  Standard  for 
Ei-leve!  Imiages),  is  a  combination  of  BIHC  and  PCSB  with  better  reduced  image 
-quality  than  either. 

The  Progressive  Bi-level  Coding  Technique  is  a  coding  technique 
showing  the  promise  of  large  compression  ratios  for  the  coding  and  progressiva 
transmission  of  bi-Ievel  data.  No  comprehensive  study  analyzing  this  techn.que, 
as  applied  to  Group  4  facsimile  systems  under  carefully  controlled  conditions, 
has  been  performed  to  date.  The  purpose  of  the  task  reported  herein  was  to 
analyze  *he  Progressive  Bi-level  Coding  Technique  to  determine  its  relati  /e 
sffectivr ness  as  applied  to  Group  4  facsimile. 

This  report  is  composed  of  six  section^  and  two  appendices.  Section 
1.0  pr'^uents  a  brief  synopsis  of  the  stud'*  and  outlines  its  results  and 
conci  sions.  Section  2.0  covers  the  theory  and  description  of  the  JBIG  Base 
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System.  Section  3.0  describes  the  simulation  system  developed  during  th-s 
study.  Section  4.0  presents  the  simulation  objectives  and  briefly  descr.bes  the 
test  images.  Section  5.0  covers  the  simulation  results,  and  Section  6.0  presents 
the  ccnciusions  derived  from  these  results.  Appendix  A  outlines  the  basic 
principles  of  binary  arithmetic  coding,  and  Appendix  3  contains  d.splays  of  the 
test  images. 

1.2  Synopsis 

The  Investigation  was  conducted  in  four  major,  and  to  seme  extent 
overlapping,  phases.  The  first  phase  consisted  of  a  study  of  J3IG 
documentation,  and  continued  throughout  the  project  as  more  was  released. 
SimuSaticn  software,  hereafter  called  the  study  system,  was  developed  m  the 
second  phase  with  the  objectives  of  functionally  duplicating  the  JBIG  Base 
System  and  of  supplementing  data  emerging  from  the  ongoing  JBIG 
investigations.  The  third  phase  was  devoted  to  simulating  the  t.-ansmission  and 
reception  of  JBIG  test  images.  The  fourth  phase  consisted  of  evaluating  the 
results,  writing  the  final  report  and  preparing  all  deliverable  items. 

Two  significant  results  emerged  from  this  study:  (1)  best  data 
compression  usually  was  achieved  by  performing  at  least  one  image  reduction 
rather  than  simply  coding  the  highest-resolution  image;  and  (2)  a!!  the  major 
features  of  the  algorithm  played  important  roles  in  data  compression,  some  Trore 
than  others,  depending  upon  image  type  and  resolution. 
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2.0  INVESTIGATION 

2.1  Theory 

2.1.1  General  Description 

The  Progressive  Bi-Leve!  Ceding  Technsqje  consists  cf  sta'^ing  with  a  bi- 
levei  'mage  of  scoie  resc-iution  and  repeatedly  reducing  it.  each  new  image 
ha/ing  half  as  many  pixels  per  line  and  half  as  many  lines  as  its  parent  image. 
(If  the  lirss  length  ^rd/c-r  number  of  lines  is  edd  in  the  parent  image,  the 
corresponding  number  in  the  reduced  image  is  rounded  up  to  the  nearest 
integer.}  The  Ic-west-resclution  image  is  called  the  base  layer;  all  others  are 
c^ied  difference  layers. 

The  base  layer  is  transmitted  withc-ut  distortion,  pixel  by  pixel,  data 
cempresskon  achieved  by  binary  arithmetic  coding.  The  next  higher  resolution 
layer  is  then  transmitted  by  a  combination  cf  prediction  and  binary  arithmetic 
coding.  Prediction  is  based  on  the  values  of  predictor  pixels  from  the  base 
layer,  at!  of  whose  pixels  are  known  to  both  transmitter  and  receiver,  and  from 
difference  layer  pLxels  already  encoded  or  predicted.  Pixels  which  the 
transmitter  and  receiver  both  "know"  can  be  predicted  correctly  are  not 
encoded.  Successively  higher  resolution  images  are  similarly  transmitted  with 
the  previous  image  used  as  a  reference  until  the  criginal  image  has  been 
transmitted  (or  the  process  stopped  at  the  receiver’s  request). 

A  sequential  mode  of  transmission  also  exists.  It  consists  of  performing 
the  entire  progressive  transmission  on  successi  ve  horizontal  stripes  of  the 
original  image,  in  this  mode  the  receiving  party  never  sees  the  entire  image 
until  all  stripes  have  been  completed. 


2.1.2  Notations  and  Definitions 

The  following  terms  to  be  used  throughc-ut  this  report  are  defined  here. 
Any  one  image  in  the  progressive  hierarchy. 


Layer 
Base  Layer 


The  lowest  resolution  image  in  the  hierarchy. 


Difference  Layer  Image  being  encoded  or  decoded  other  than  the 
base  layer. 

Reference  Layer  The  layer  whose  dimensions  are  half  those  of  the 

difference  layer  currently  being  enccKied  or  decoded.  The 
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Template 


has  no  reference  layer.' 

A  set  of  pixels  in  the  reference  and/or  encoded  (decoded) 
layer  in  the  neighborhood  of  the  pixel  being  processed. 
Templates  are  used  for  prediction  and  for  binary  ar'tnmetic 
coding  "contexts." 

State  A  binary  number  representing  the  values  of  the  pixels  in  a 

template;  also  the  current  point  in  the  probab-lity 
estimation  chain  of  the  binary  arithmetic  coder. 

at  Adaptive  context  templates,  described  below. 

DP  Deterministic  prediction,  described  below. 

TP  Typical  prediction,  described  below. 

2.1.3  Image  Reduction 

In  all  the  image  reduction  algorithms  proposed  in  various  JBIG  documents, 
including  PRES,  each  low-resolution  pixel  is  determined  by  the  values  of  several 
high^resolution  pixels  and  low-resolution  pixels  that  have  already  been 
detfeTmined.  The  objective  of  'the  .'•eduction  algorithm  is  to  preserve  as  much 
detail  as  possible  in  the  low-resolution  image  under  the  constraint  that  the 
latter  be  half  as  wide  and  high  as  the  high-resolution  image.  Different 
algorithms  approach  this  objective  in  different  ways;  the  PRES  approach  is 
described  later. 

2.1.4  Prediction 

When  a  difference  layer  is  being  encoded  or  decoded,  much  of  the 
compression  is  achieved  by  predicting  new  pixel  values  from  the  values  of  pixels 
in  a  predictor  template.  The  predictor  template  contains  pixels  from  the 
reference  layer  and  pixels  already  predicted  or-^jpoded  from  the  difference 

'  In  image  reduction,  the  higher-resolution  layer  is  called  the  reference  layer,  since  the 
lower-resolution  layer  is  the  layer  being  produced. 
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layer.  When  the  predictor  state  is  such  that  the  prediction  is  known  to  be 
correct  (the  receiver  must  know  this  also),  the  predicted  pixel  value  need  not  be 
encoded  or  decoded.  The  JBIG  algorithm  employs  two  kinds  of  prediction; 
typical  prediction  (TP)  and  deterministic  prediction  (DP).  The  meanings  of  TP 
and  DP  are  described  in  general  terms  here,  and  their  implementations  .n  the 
JBIG  aigorithm  are  described  in  a  later  section. 

2.1.4.1  Typical  Prediction  (TP) 

■typical  prediction  refers  to  prediction  in  which  the  predicted  value  is 
almost  always,  but  not  necessarily  always,  correct.  Since,  in  bi-!evel  imagery, 
each  pixel  carries  only  one  bit  of  information,  it  would  be  wasteful  for  the 
transmitter  to  inform,  the  receiver  of  whether  the  prediction  is  correct  for  each 
pixe'  predicted.  Instead,  the  transmitter  looks  ahead  for  and  reports  TP  errors 
(exceptions).  One  reporting  method  is  to  transmit  a  pointer  to  the  next 
exception.  Another  is  to  transmit  an  exception/no-exception  bit  at  known 
intervals,  the  bit  set  if  the  interval  following  it  contains  at  least  one  exception. 

If  so,  TP  is  disabled  throughout  that  entire  interval. 

2.1.4.2  Deterministic  Prediction  (DP) 

Deterministic  prediction  refers  to  prediction  in  which  the  predicted  value 
is  always  correct.  DP  is  tightly  bound  to  the  image  reduction  rules.  Whether  a 
pixel  is  or  is  not  deterministically  predictable  is  determined  by  looking  up  a  rule 
in  a  table  indexed  by  the  state  of  the  predictor  pixels.  The  rule  is  one  of  the 
following;  Predict  black.  Predict  white  or  Don’t,  predict. 
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2.1.5  Binary  Arithmetic  Coding 

2.1. 5.1  Binary  Arithmetic  Coders 

Appendix  A  presents  the  basic  principles  of  binary  arithmetic  coders.  In 
a  later  section  the  JBIG  coder  is  briefly  described. 

2. 1.5.2  Contexts 

In  the  description  of  binary  arithmetic  coding  presented  in  Appendix  A,  it 
is  assumed  that  the  context  (in  the  ordinary  English  sense)  in  which  the  s>mbol 
occurs  is  ignored. 

The  data  compression  achieved  by  a  binary  arithmetic  coder  is  best  when 
the  probabilities  of  the  two  symbols  are  near  1  and  0,  and  worst  when  they  are 
near  1/2.  In  any  practical  application,  the  probability  of  a  1  or  0  at  any  given 
time  is  f^'equently  dependent  upon  the  conditions  under  which  the  symbol  is 
being  encoded  or  decoded.  Therefore,  best  compression  is  achieved  by  keeping 
separate  probability  estimates  for  those  conditions  under  which  the  enc  'd 
Symbol  probabilities  are  the  most  strongly  skewed.  These  conditions  are  ..ailed 
contexts. 

Consider,  for  example,  a  bi-level  image  containing  line  drawings  and  text. 
As  this  image  is  scanned,  if  the  previous  pixel  was  white,  then  there  is  a  high 
probability  that  the  current  one  will  be  white  also.  Therefore,  if  one  uses  the 
previous  pixel  value  as  a  predictor,  there  are  two  contexts,  one  for  each  color  of 
the  previous  pixel.  The  probabilities  for  each  are  usually  much  nearer  1  and  0 
than  is  the  single  probability  with  the  previous  pixel  value  ignored. 

In  the  JBIG  system,  there  is  a  separate  context  for  every  possible 
combination  of  pixel  values  in  a  context  "template."  These  templates  are 
described  later.  Because  there  can  be  thousands  of  contexts,  an  important 
consideration  in  system  design  is  to  minimize  the  memory  requirements  for  each. 
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2.1. 5,3  Adaptive  Context  Templates 

The  purpose  of  adaptive  context  templates  is  to  take  advantage  of 
horizontal  periodicity,  which  often  occurs  in  half-tone  images. 

Data  compression  is  best  if  at  least  one  of  the  pixels  in  a  context  template 
is  a  good  predictor  of  the  pixel  being  encoded.  An  adaptive  context  template, 
bearing  the  acronym  AT  (adaptive  template,  formerly  AC  for  adaptive  context), 
contains  a  "floating"  pixel;  all  other  pixels  in  the  template  are  fixed  in  position 
relative  to  the  encoded  pixel.  There  are  also  other  pixels  designated  as 
candidate  floating  pixels,  not  currently  a  part  of  the  context  template. 

If  one  of  the  candidates  becomes  a  much  better  predictor  than  the  current 
floating  pixel,  then  the  candidate  and  current  floating  pixels  swap  status,  so 
that  the  better  predictor  becomes  a  part  of  the  context  template.  This  test  is 
made  infrequently,  and  the  swap  is  made  only  if  the  candidate  is  a  much  better 
predictor.  These  restrictions  are  imposed  because,  when  a  swap  is  made, 
compression  is  temporarily  degraded  until  the  binary  arithmetic  coder  has  time 
to  adapt  to  the  swap. 

2.2  Transmission  Modes 

The  JBIG  specification  calls  for  two  transmission  modes;  progressive  and 
sequential. 

2.2.1  Progressive  Transmission 

In  progressive  transmission  the  entire  starting  image  is  reduced  to  half 
its  height  and  width.  The  reduced  image  is  similarly  reduced,  this  process 
repeated  some  specified  number  of  times.  The  image  layer  produced  by  the  last 
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reduction  is  called  the  base  iayer. 

The  base  layer  is  encoded.  The  transmitter  then  performs  prediction 
where  possible,  encoding  those  pixels  which  cannot  be  predicted,  to  transmit  the 
next  higher  resolution  layer,  using  the  base  iayer  as  a  reference.  This  higher- 
resolution  layer  is  then  used  as  a  reference  to  predict  and  encode  a  still 
higher-resolution  layer.  This  progression  is  continued  until  the  original  image 
has  been  transmitted. 

2.2.2  Sequential  Transmission 

Sequential  transmission  consists  of  dividing  the  original  image  into 
horizontal  stripes  and  then  transmitting  each  stripe  in  the  progressive  mode. 

The  whole  series  of  progressive  transmissions  described  above  is  performed  on 
each  stripe  before  it  is  begun  on  the  next. 

2.3  The  JBIG  Base  System 
2.3.1  Summary 

The  base  system  consists  of  two  major  parts:  image  reduction  and 
transmission.  The  base  specification  recommends  the  PRES  image  reduction 
algorithm  (Progressive  Reduction  Scheme  for  Bi-level  Images),  but  does  not 
require  it.  The  following  description  includes  PRES,  and  parts  of  the  rest  of  the 
system,  especially  deterministic  prediction  (DP),  depend  upon  the  employment  of 
PRES. 

In  the  progressive  transmission  mode,  the  encoder  reduces  the  whole 
image  a  specified  number  of  times,  e.g.  5.  The  highest  image  layer,  RO,^  is 


2 

The  notation  used  here  is  not  necessarily  a  part  of  the  JBIG  Base  System  Specification. 
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reduced  to  form  layer  R1,  R1  to  form  R2 . R4  to  form  R5.  R5  is  called  the  base 

layer;  RO  through  R4  are  called  difference  layers.  One  of  the  study  objectives 
was  to  measure  the  total  compression  as  a  function  of  how  many  reductions  are 
performed,  including  none. 

Transmission  is  begun  by  the  encoder’s  encoding,  and  the  decoder’s 
decoding,  the  base  layer,  using  a  binary  arithmetic  coder  with  contexts 
consisting  of  causal  pixels  (pixels  known  to  both  encoder  and  decoder)  in  the 
neighborhood  of  the  target  pixel.  AT  (Adaptive  Template)  is  employed:  the 
context  template  has  a  floating  pixel  that  can  change  if  a  candidate  floating  pixel 
is  a  much  better  predictor.  Since  the  base  layer  has  no  reference  layer,  no 
prediction  of  any  kind  is  attempted  except  in  the  sense  that  the  contexts  act  as 
predictors  to  improve  data  compression. 

Next,  layer  R4  is  transmitted.  Because  a  reference  layer  (R5)  is  now 
available,  prediction  is  employed.  Both  TP  (typical  prediction)  and  DP 
(deterministic  prediction)  are  employed,  with  TP  tried  first.  If  a  pixel  in  layer 
R4  is  TP,  it  is  not  encoded.  If  not,  the  pixel  is  tested  for  beirig  DP,  and  if  so,  is 
not  encoded.  If  the  pixel  is  neither  TP  nor  DP,  it  is  encoded  with  adaptive 
context  templates  which  include  pixels  from  both  layers.  The  floating  pixel  is 
always  in  the  layer  being  encoded. 

Similarly,  layers  R3,  R2,  R1  and  finally  RO  are  encoded  and  decoded,  with 
the  next  lower-resolution  layer  (next  higher  layer  number)  serving  as  the 
reference  layer. 

JBIG  investigators  considered  two  types  of  scan:  row  scan  and  z  scan,^^^ 
and  they  selected  row  scan.^^^  In  row  scan,  each  image  line  is  transmitted  in  its 
entirety  before  the  next  is  begun.  A  prediction  or  context  template  may  contain 
pixels  anywhere  above  the  current  line  or  to  the  left  of  the  target  pixel  in  that 


2-7 


line.  In  z-scan,  two  lines  are  scanned  simultaneously.  At  each  point  in  the 
scan,  a  square  block  of  four  pixels,  two  in  each  of  the  two  lines,  is  transmitted, 
and  the  scan  is  advanced  to  the  next  such  block.  Any  pixel  above  the  pair  of 
lines  or  to  the  left  of  the  block  in  either  line  of  the  pair  can  belong  to 
templates. 

In  sequential  transmission,  the  progressive  mode  is  applied  to  successive 
horizontal  stripes  of  the  original  image.  Each  stripe  is  reduced  the  prescribed 
number  of  times,  and  the  base  layer  and  then  successively  higher-resolution 
layers  are  transmitted.  The  whole  process  is  performed  on  each  stripe,  which  is 
treated  as  a  smail  image  being  transmitted  in  the  progressive  mode.  Some 
proposals  suggest  using  the  first  line  of  the  next  stripe  as  the  assumed  line 
below  the  bottom  of  the  current  stripe  instead  of  a  replication  of  the  last  line  of 
the  current  stripe.  These  boundary  conditions  are  described  below  for  the 
progressive  case.  There  is  also  the  possibility  of  saving  the  binary  arithmetic 
coder  contexts  after  encoding  (decoding)  each  layer  of  a  stripe,  so  they  can  be 
used  again  for  the  next  stripe.  This  might  improve  compression  because  the 
binary  arithmetic  coder  would  not  have  to  readapt  to  the  image  statistics  after 
initially  assuming  equal  black  and  white  probabilities  in  all  contexts. 

2.3.2  Image  Pixel  Notation 

The  following  description  applies  to  both  image  reduction  and 
transmission.  Figure  2.1  shows  the  relationships  among,  and  notations  for, 
pixels  in  an  image  layer  having  a  given  resolution  and  in  the  layer  having  half 
this  resolution.  In  what  follows,  the  former  layer  will  be  called  the  high- 
resolution  layer  and  the  latter  the  low-resolution  layer.  High-resolution  pixels 
e,  f,  h  and  I  register  with  low-resolution  pixel  P,  as  shown  by  the  heavy 
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rectangles. 

In  image  reduction,  pixel  values  in  the  high-resolution  layer,  and  those  in 
the  low -resolution  layer  already  determined,  are  used  to  determine  the  next  low- 
resolution  layer  pixel.  In  image  encoding  and  decoding,  pixel  values  in  the  low- 
resolution  layer  and  those  already  encoded  (decoded)  in  the  high-resolution 
layer  are  used  to  try  to  predict  the  next  high-resolution  pixel. 

In  JBIG  literature  and  in  this  report,  the  term  "phase"  is  sometimes  used 
to  refer  to  one  of  the  four  high-resolution  pixels  denoted  by  e,  f,  h  and  i. 

Phase  0  refers  to  e,  1  to  f,  2  to  h  and  3  to  i. 

2.3.3  PRES  Image  Reduction 

The  simplest  method  of  reducing  an  image  to  half  its  size  in  both 
dimensions  is  straight  sub-sampling:  keep  every  other  pixel  in  a  given  high- 
resolution  line,  and  do  this  to  every  other  line.  In  bi-level  images,  however,  this 
method  quickly  washes  out  detail,  images  containing  text  or  line  drawings, 
lines  forming  the  drawings  or  text  v^haracters  grow  thinner  with  each  reduction, 
and  soon  disappear.  In  half-tone  images,  gray  levels  become  badly  distorted. 

JBIG  participants  selected  PRES  from  a  number  of  image  reduction 
algorithms,  for  example:  Progressive  Coding  Scheme  Using  Block  Reduction 
(PCSB)^®^;  rogressive  Edge  Decomposition  of  Facsimile  Images  (PED)^®^;  and  a 
projection  method  that  combines  filtering,  sub-sampling  and  line  preservation 

The  PRES  algorithm  consists  of  two  parts:  (1)  a  formula  for  determining  a 
low-resolution  pixel  value,  and  (2)  a  list  of  exceptions  that  override  the  formula. 
The  formula  is,  in  effect,  a  filter,  and  the  exceptions  seek  to  preserve  such 
features  as  lines,  edges  and  dither  patterns. 

Figure  2.2  shows  the  windows  that  are  scanned  over  the  high-  and  low- 
resolution  images  as  reduction  is  performed.  (The  low-resolution  image  can  be 


2-10 


High-resolution  Window 
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P 

Low-resolution  Window 


Figure  2.2  High  and  Low  Resolution  Windows 
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High-res.  Window 


High-resolution  Image 


Low-res.  Window 


Low-resolution  Image 


Note:  The  high-resolution  image  dimensions  are 

assumed  to  be  even.  See  text  for  description  of 
treatment  when  either  dimension  is  odd. 

Figure  2.3  Window  Alignment  at  Image  Corners 
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thought  of  as  containing  "blank"  pixels,  with  PRES  filling  in  the  blanks.)  The 
heavy  rectangles  in  the  two  windows  show  how  the  four  high-resolution  pixels  e, 
f,  h  and  i  overlay  pixel  P,  the  low-resolution  pixel  to  be  determined  at  a 
particular  place  in  the  scan.  Low-resolution  pixels  A,  B  and  C  have  already 
been  determined. 

Figure  2.3  shows  how  the  high-  and  low-resolution  windows  align  with  the 
image  corners,  where  the  high-resolution  image  is  assumed  to  have  an  even 
number  of  pixels  per  line  and  an  even  number  of  lines.  The  large,  heavy 
rectangles  represent  the  actual  image  boundaries.  Imaginary  white  pixels  are 
assumed  above  and  to  the  left  of  both  images.  In  the  high-resolution  image, 
if  the  number  of  pixels  per  line  is  odd,  each  image  line  is  assumed  to  have  a 
white  pixel  appended  to  its  right  end,  therefore  '^.aking  the  effective  image 
width  even.  Similarly,  if  the  number  of  lines  is  odd,  an  extra  line  of  white  pixels 
is  assumed  to  be  appended. 

The  width  and  height  of  the  low-resolution  image  is  given  by: 

W  =  (w  +  1)/2 

and 

H  =  (h  +  1)/2, 

where  W  and  H  are  the  low-resolution  image  width  and  height,  w  and  h  are  the 
actual  high-resolution  width  and  height,  and  the  divisions  are  integer  divisions, 
the  results  equivalent  to  rounding  the  real  divisions  w/2  and  h/2  to  the  nearest 
integer.  Thus,  for  odd  w  (h),  W  (H)  is  half  the  width  (height)  of  the  high- 
resolution  image  with  the  white  pixel  (line  of  pixels)  appended. 

High-resolution  pixels  a,  b,  c,  d  and  g,  and  low-resolution  pixels  A,  B  and 
C,  always  have  known  values,  because  imaginary  white  pixels  are  assumed  above 
and  to  the  left  of  both  images.  Thus,  the  PRES  algorithm  need  not  make 
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exceptions  at  the  image  boundaries;  assuming  that  the  images  are  framed  by 
white  pixelstakes  care  of  the  boundary  conditions  and  of  the  case  of  odd  high- 
resolution  image  width  or  height. 

After  the  first  iow-resolution  pixel  has  been  generated,  the  high- 
resoiution  window  is  moved  two  pixels  to  the  right,  and  the  low -resolution 
window  one  pixel  to  the  right.  The  first  line  of  each  image  is  thus  scanned  unt  I 
the  end-of-line  condition  is  reached  as  shown  in  Figure  2.3.  Next,  the  high- 
resolution  window  is  moved  to  its  starting  position,  but  down  two  lines. 

Similarly,  the  low-resolution  window  is  moved  tc  its  starting  position,  but  down 
one  line.  This  scanning  is  continued  until  all  low-resolution  pixels  have  been 
generated. 

The  PRES  reduction  formula  is,  in  the  absence  of  an  exception: 

Set  SUM  =  4e  +  2(b  +  d  +  f  +  h)  +  a  +  c  +  g  +  i  -  3(B  +  C)  -  A. 

If  SUM  is  greater  than  or  equal  to  5,  set  P  =  1,  otherwise  0. 

This  formula  is,  in  effect,  a  filter,  with  previously-determined  low-resolution 
pixels  included. 

The  exceptions  reside  in  a  table  containing  exception  states  and  e.xception 
pixel  values.  An  exception  state  is  formed  by  concatenating  pixel  values  A,  B,  C, 
a,  b,  c,  d,  e,  f,  g,  h,  and  i  in  left  to  right  order  and  right-justifying  these  bits 
in  an  integer  word,  where  these  pixel  values  represent  some  feature  that  must 
be  preserved.  The  existence  of  an  exception  is  tested  by  similarly  assembling 
the  actual  pixel  values  into  a  state  and  searching  the  exception  table  for  a 
matching  state.  If  a  match  is  found,  the  required  value  of  P  is  taken  from  the 
exception  table;  otherwise  it  is  computed  from  the  formula. 

In  an  actual  PRES  reduction  implementation,  the  whole  algorithm,  including 
the  formula  and  the  exceptions,  is  contained  in  a  PRES  look-up  table  to  save 
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processing  time.  For  each  low-rssoiution  pixel,  P,  to  be  generated,  a  state  is 
assembled  as  described  above.  This  state  is  used  as  an  index  to  the  table,  from 
which  the  .'•equired  value  of  P  is  retrieved.  The  PRES  table  is  originally 
generated  by  executing  the  actual  PRES  algorithm  once  for  each  of  the  40S5 
possible  state  values. 

2.3.4  Generic  Typical  Prediction  (TP-G) 

.JBIG  participants  selected  generic  typical  prediction  from  among  other 
schemes^®^  that  included  rules  tied  to  the  image  reduction  rules  in  addition  to 
the  generic  rules  to  be  described  here.  TP-G  refers  to  TP  based  only  on  the 
generic  rules.  For  the  remainder  of  this  report,  the  notation  TP  implies  TP-G. 

In  the  following  description,  pixel  notation  and  the  .'"egistration  of  the 
high-  and  low-resc’ution  images  are  as  shown  In  Figure  2.1. 

In  generic  typical  prediction,  a  TP  cluster  is  defined  as  a  low-resc-lution 
pixel  surrounded  by  a  neighborhood  of  pixels  of  the  same  color.  The 
neighborhood  consists  of  pixels  horizontally,  ertically  and  diagonally  adjacent 
to  the  pixel  in  question.  (The  boundary  conditions  for  TP-G  and  fo'  all 
encoding  and  decoding  operations  are  the  same  as  for  image  reduction,  except 
that  the  imaginary  pixels  below  an  actual  image  are  assumed  to  be  copies  of  the 
pixels  in  the  last  image  line,  instead  of  white.) 

If  low-resolution  pixel  P  belongs  to  such  a  cluster,  then  high-resolution 
pixels  e,  f,  h  and  i  almost  always  have  the  same  color  as  P.  This  is  particularly 
true  in  images  having  large  areas  of  solid  color,  as  in  line  drawings  and  text. 

Prior  to  encoding  a  pair  of  high-resolution  image  lines,  the  encoder 
performs  a  TP  test  consisting  of  examining  the  corresponding  low -resolution 
iine.  (No  prediction  of  any  kind  is  employed  for  the  base  layer,  where  there  is 
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no  lower-resolution  layer.)  For  each  low-resolution  pixel,  P,  belonging  to  a  TP 
cluster,  high-resolution  pixels  e,  f,  h  and  i  are  examined.  If  they  all  have  the 
same  color  as  P,  then  the  four  high-resolution  pixels  are  predictable.  If  not, 
then  a  TP  exception  has  been  encountered. 

In  the  JBIG  Base  System,  if  a  TP  exception  is  found  anywhere  in  the  low- 
resolution  line,  then  that  whole  line  is  declared  as  an  exception,  and  no  TP  is 
attempted  in  the  corresponding  pair  of  high-resolution  lines.  After  the  TP 
exception  check  has  been  completed,  an  exception  bit,  e.g.  i  for  exception,  0  for 
no  exception,  is  encoded  in  its  own  context  to  tell  the  decoder  whether  to  employ 
TP  while  aecoding  the  pair  of  high-resolution  lines. 

Other  schemes  have  been  proposed  for  reporting  TP  exceptions.  For 
example,  one  could  encode  an  exception  bit  more  than  once  per  line,  or  one  could 
encode  a  pointer  to  the  next  exception.  The  selected  method  gives  good  results, 
'and  is  computationally  simple.  This  method  is  described  in,  for  example,  JBIG 
Document  N131.^®^ 

If  no  exception  is  found,  then,  whenever  a  low-resolution  pixel,  P, 
belonging  to  a  TP  cluster  is  encountered,  the  corresponding  pixels,  e,  f,  h  and  i, 
are  not  encoded.  The  decoder,  having  been  told  that  there  is  no  TP  exception, 
merely  inserts  pixels  having  the  same  color  as  that  of  pixel  P. 

Since  the  same  low-resolution  line  applies  to  two  high-resolution  lines,  the 
study  system  saves  execution  time  by  having  the  TP  test  routine  generate  a 
string  of  TP  flag  bits,  one  per  low-resolution  pixel. ^  A  bit  value  of  1  means  that 
the  low-resolution  pixel  belongs  to  a  TP  cluster.  Then,  as  each  high-resolution 
line  of  the  pair  is  scanned,  two  pixels  at  a  time,  the  flag  bits  are  also  scanned. 


This  method  was  suggested  by  C  Chamzas  of  AT&T  in  a  private  communication. 
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one  bit  at  a  time.  If  the  flag  bi*  s  1,  then  the  two  high-resolution  pixels  are 
not  encoded.'* 

2.3.5  Deterministic  Prediction  (DP) 

In  the  JBIG  Base  System,  DP  is  tied  to  the  PRES  reduction  rules. The 
encoder  and  decoder  determine  whether  a  target  difference  layer  pixel  is  DP  by 
a  table  search.  The  table  is  created  from  a  set  of  DP  rules  specified  in  JBIG 
documents  N141  and  N199.^’*^  A  DP  state  is  constructed  from  a  DP  template 
consisting  of  pixels  from  the  lower-resolution  reference  layer  as  well  as  causal 
pixels  from  the  higher-resolution  difference  layer  being  encoded  (decoded).  The 
table  is  then  searched  for  a  matching  state  for  the  phase  in  question  (0,  1,2  or 
3,  i.e.,  pixels  e,  f,  h  and  i,  as  in  Figure  2.1).  If  a  match  is  found,  then  the 
target  pixel  value  is  taken  from  the  table  instead  of  being  encoded  or  decoded. 

The  DP  states  are: 

Phase  State  (Predictor  Pixels)  Target  Pi.''el 


0  A,B,C,P,a,b,c,d  e 

1  A,B,C,P,a,b,c,d,e  f 

2  A,B,C,P,a,b,c,d,e,f,g  h 

3  A,B,C,P,a,b,c,d,e,f,g,h  i 


As  in  PRES,  computation  time  is  saved  by  using  one  look-up  table  for  all 
the  states  of  all  the  phases.  For  a  given  phase,  the  DP  state  is  assembled  from 
the  appropriate  pixels  and  a  table  look-up  is  performed.  There  is  always  a  table 


The  base  system  employs  row  scan;  each  high-resolution  Tine  is  scanned  completely  before 
the  next  is  begun.  An  alternative  is  z-scan,  in  which  two  Tines  are  scanned 
siffluTtaneousTy ,  and  pixels  e,  f,  h  and  i  are  processed  together. 
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entry,  whether  or  not  the  target  pixel  is  DP.  If  it  is,  the  table  delivers  the 
predicted  value,  1  or  0;  otherwise  it  delivers  a  "don’t  predict"  code,  e.g.  2  or  3. 

The  encoder  simulator  in  the  study  system  verifies  correct  prediction. 

Any  prediction  error  would  imply  a  bad  PRES  or  DP  table  or  a  program  failure. 

2.3.6  Adaptive  Context  Templates  (AT) 

2.3.6.1  Template  Descriptions 

Figure  2.4  (two  pages)  shows  the  context  templates  for  encoding 
(decoding)  base  layer  pixels  and  difference  layer  pixels  e,  f,  h  anJ  i  (phases  0, 

1,  2  and  3).  These  templates  are  documented  in  JBIG  Document  N174.^’^^ 

Small  squares  represent  high-resolution  pixels  for  the  difference  layers, 
and  all  base  layer  pixels.  Large  squares  represent  low-resolution  (reference) 
layer  pixels  for  difference  layer  processing.  All  pixel  positions  are  shown 
relative  to  the  pixel  being  encoded  (decoded),  which  is  labeled  with  a  question 
mark. 

Squares  not  containing  characters  represent  pixels  that  are  always  in  the 
template.  In  each  diagram  there  is  one  square  labeled  with  an  asterisk  (*). 

This  represents  the  default  AT  floating  pixel;  it  is  initially  in  the  template,  but 
will  not  be  following  an  AT  swap  unless  it  is  swapped  back  in  later.  Squares 
labeled  3,  4,  5,  6,  7  and  8  denote  candidate  floating  pixels,  not  initially  in  the 
template.  These  are  called  "lag”  pixels  because  they  lag  behind  the  encoded 
(decoded)  pixel  by  a  number  of  pixels  equal  to  their  label  values. 

Whether  or  not  the  default  or  one  of  the  lag  pixels  is  the  floating  pixel, 
the  base  layer  has  7  pixels  in  its  template.  Thus,  there  are  128  contexts  for  the 
base  layer,  i.e.  128  possible  combinations  of  black  and  white  pixel  values  in  the 
template.  For  each  difference  layer  template,  there  are  10  pixels,  some  high- 


2-18 


□ 

□  □  □  □ 

HSESlIIBnnE 

Base  Layer 


□  □  □  □ 


Difference  Layer,  Phase  0 


□ 

□ 

0 


Legend 

Pixel  to  be  coded  (decoded) 

Pixel  always  in  template 

Default  floating  pixel 

Candidate  floating  pixels 
(n  =  3,  4,  5,  6,  7  or  8) 


Figure  2.4  Adaptive  Context  Templates  (Part  1  of  2) 
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Figure  2.4  (Continued)  Adaptive  Context  Templates  (Part  2  of  2) 
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and  some  low-resolution.  Thus,  there  are  1024  contexts  for  each  phase,  or  a 
total  of  4096  contexts  for  difference-layer  processing.  (The  extra  context  for  TP 
exceptions  is  not  included  in  this  total.)  Note  that  it  is  perfectiy  legitimate  for 
iow-resolution  pixels  to  lie  below  or  to  the  right  of  the  encoded  (decoded)  pixei; 
ail  low-resolution  pixels  are  known  to  both  encoder  and  decoder. 

2.3.6.2  AT  Algorithm 

The  AT  aigorithm  (Adaptive  Template,  formerly  called  AC  for  Adaptive 
Context)  is  described,  for  example,  in  JBIG  Documents  N139-R2^^^^  and  N208- 

The  default  floating  pixel  and  the  six  iag  pixeis,  shown  in  each  diagram  of 
Figure  2.4,  are  coilectively  called  AT  (formerly  AC)  pixels;  at  any  given  time,  one 
of  them  is  the  floating  pixel.  The  difference  layer  templates  are  so  designed 
that  the  seven  AT  pixels  have  the  same  spatial  relationships  to  the  target 
(encoded  or  decoded)  pixel,  independent  of  phase.  Therefore,  floating  pixei 
swaps  can  be  tested  and  performed  for  all  phases  at  once.  (Document  JBIG  N139 
does  not  assume  phase  independence;  N208  does.) 

Floating  pixel  changes  are  controlled  in  the  following  manner.  To  each  AT 
pixel  is  allocated  a  counter  whose  count  is  initially  zero.  A  "total”  counter,  C^,^, 
is  maintained  and  also  initialized  to  0.  For  each  target  pixel,  a  decision  is  made 
whether  to  update  the  counters.  Criteria  for  this  decision  are  given  iater. 
Updating  the  counters  consists  of  incrementing  the  totai  count,  C,,p  and  each 
counter  associated  with  an  AT  pixel  having  the  same  color  as  the  target  pixel. 
These  updates  are  made,  of  course,  after  the  target  pixel  has  been  transmitted 
(received);  the  decoder  performs  the  same  updates. 

At  appropriate  times  (criteria  also  given  later),  decisions  are  made 
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whether  to  swap  the  current  floating  pixel  with  another  AT  pixel.  The  decision 
criteria  are  designed  to  make  AT  switching  infrequent,  because,  after  the 
switch,  the  probability  estimates  become  poor,  with  a  consequent  degradation  of 
compression,  until  the  binary  arithmetic  coder  probability  estimator  has  time  to 
adapt  to  the  change. 

AT  counts  are  updated  only  if  all  of  the  following  conditions  hold:  (1)  ~he 
current  coding  (decoding)  line  is  not  the  first  or  second  (difference  layer),  or 
not  the  first  (base  layer);  (2)  All  lag  pixels  are  inside  the  actual  image;  and  (3) 
The  current  pixel  is  not  typically  predictable.  TP  pixels  are  avoided  because 
they  tend  to  occur  in  regions  of  solid  color,  where  the  use  of  AT  is  not 
intended.  (DP  pixels  are  included,  even  though  they  are  not  encoded.) 

The  swap/no-swap  decision  criteria  are  taken  directly  from  JBIG  N139-R2. 
Let  be  the  maximum  value  of  the  counts  in  the  seven  AT  counters,  and 
be  the  minimum  count.  Let  and  be  the  maximum  and  minimum  counts 
excluding  the  count  for  the  default  floating  pixel.  Let  be  the  count  for 

the  current  floating  pixel,  default  or  not.  A  floating  pixel  swap  is  made  only  if 
ail  of  the  following  conditions  are  met: 

1.  >  (7/8)  (The  study  system  tests  for  ^ 

^max  ~  Afloat  ^  ^all  ~  ^fflax> 

3-  C^ax  "  ‘Afloat  ^ 

^max  “  (*-'an  ”  ^float^  ^  ^'all  "  ^inax> 

^max  "  ^^all  ^'float^  ^ 

*^max  “  ^  ^all/^> 

7.  Current  floating  pixel  is  not  the  default  or  -  E^^,,  >  Can/8. 

Condition  1  requires  that  the  count  maximum  be  very  strong.  AT 
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switching  is  desired  in  dithered  images  and  nowhere  e.se.  Condition  2  requires 
that  the  maximum  count  is  moving  more  than  half  way  rrom  the  current  floating 
pixel  count  to  the  maximum  possible  count.  Condition  3  makes  sure  that  the 
maximum  count  is  considerably  greater  than  that  of  the  current  floating  pixel. 
Conditions  4  and  5  are  similar  to  2  and  3,  but  prohibit  switching  when  the 
current  floating  pixel  has  strong  negative  correlation  with  the  target  pixel. 
Condition  6  requires  that  not  all  counts  are  high;  AT  swaps  are  not  wanted  in 
regions  of  solid  color.^  Finally,  condition  7  makes  it  hard  to  swap  the  initial 
default  floating  pixel. 

The  test  for  swapping  is  made  at  the  end  of  every  second  line  for 
difference  layers,  at  the  end  of  each  line  for  the  base  layer,  provided  that  the 
total  count,  C^,,,  is  at  least  2048.  If  the  test  is  made,  all  counters  are  reset  to 
zero,  whether  or  not  a  swap  occurs. 

2.3.7  Binary  Arithmetic  Coder 
2.3.7. 1  Coder  Selection  Criteria 

The  question  of  which  binary  arithmetic  coder  (BAC)  would  be 
recommended  was  an  outstanding  JBIG  issue  until  the  late  winter  of  1989-1990. 
The  selected  coder  was  to  have  the  ge..eric  name  GABAC  (General  Adaptive 
Binary  Arithmetic  Coder)  and  the  specific  name  QM  Coder,  where  QM  stands  for 
Quick-Minimax,  Quick-Melcode  or  Q-Modified.^’®^  This  coder  was  to  combine  the 
best  features  of  the  three  strongest  contenders:  Q-coder  (IBM),  Mel-Coder 
(Mitsubishi)  and  Minimax  coder  (AT&T). 


In  difference  layers,  such  regions  tend  to  be  comprised  of  TP  pixels,  which  are  not 
included  in  the  AT  counts.  In  the  base  layer,  however,  there  is  no  prediction. 
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JBIG  document  N158-R0^’®^  gives  an  excellent  review  of  the  selection 
criteria,  which  are  summarized  below.  Appendix  A  of  this  report  is  a  description 
of  the  principles  of  a  BAG. 

MPS/LPS 

It  was  agreed  by  all  parties  that  classifying  a  binary  symbol  as  MPS  (more 
probable  symbol)  or  LPS  (less  probable  symbol)  is  superior  to  high/low,  1/0, 
black/white,  etc.. 

Computing  MPS/LPS  Sub-interval 

The  better  the  MPS  and  LPS  probability  estimates,  the  better  the 
compression,  other  factors  being  equal.  In  Appendix  A  the  BAG  description 
specifies  multiplication  to  compute  the  LPS  sub-interval.  Full  multiplication  is 
costly,  whether  implemented  in  hardware  or  software.  Approximations  to  the 
multiplications  vary  among  the  different  coders;  all  lead  to  some  compression 
degradation  with  respect  to  full  multiplication,  typically  m  the  order  of  tenths 
of  a  percent  to  a  percent. 

Carry  Management 

Means  must  be  provided  to  prevent  the  propagation  of  carry  bits  into 
data  that  have  already  been  transmitted.  One  of  the  methods,  bit  stuffing,  has 
the  beneficial  effect  of  creating  a  range  of  values  for  the  byte  following  a  byte 
of  all  I’s  that  can  never  occur  in  the  code  stream.  An  all-1 ’s  byte  followed  by 
an  "illegal"  byte  can  therefore  be  used  as  a  marker  code,  as  in  the  JPEG  (Joint 
Photographic  Experts  Group)  environment. 

Byte  vs.  Bit  Output 

As  of  October  1989,  the  0-coder  was  the  only  contender  whose  output  was 
byte  oriented.  This  feature  simplifies  the  output  path  and  provides  a  means  of 
transmitting  marker  codes. 
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Skew 


The  greater  the  MPS/LPS  probability  skew,  the  greater  the  compression. 
The  greater  the  precision  of  the  probability  estimator,  the  greater  the  skew  it 
can  support.  (The  probability  estimator  must  never  estimate  an  LPS  probability 
of  zero,  else  the  LPS  sub-interval  would  vanish,  and  the  code  stream  could  not 
be  decoded.) 

Probability  Estimation 

The  0-coder  probability  estimator  is  much  simpler  than  those  of  the 
others,  because  the  probability  estimate  is  updated  without  counting  symbols 
when  the  A  interval  is  renormalized.  The  0-coder  estimator  behaves  like  a 
"finite  state  machine"  represented  by  a  table  described  later.  The  other  two 
coders  count  symbols,  with  varying  means  of  compromising  between  memory 
allocation  and  probability  estimation  accuracy. 

Fast  Learning  ("Fast  Attack") 

The  reference  covers  fast  learning  under  probability  estimation.  It  is 
brought  out  separately  in  this  report  because  of  its  great  importance  in  the 
JBIG  environment,  where  there  are  thousands  of  contexts  in  difference  layer 
processing.  Compression  in  any  given  context  is  good  when  both  of  the 
following  conditions  are  met:  (1)  at  least  one  of  the  context  template  pixels  is  a 
good  predictor,  and  (2)  the  probability  estimate  for  each  context  is  accurate.  In 
the  JBIG  Base  System,  the  probability  estimates  of  all  contexts  are  initialized  to 
0.5;  hence  compression  does  not  begin  to  get  good  until  the  estimates  have  had 
time  to  approach  the  actual  probabilities.  "Fast  attack"  refers  to  the  ability  to 
estimate  probabilities  approaching  the  true  values  after  only  a  few  "hits"  on  any 
given  context. 


2-25 


2.3.7.2  The  Selected  Coder 


The  coder  finally  selected  is  an  enhanced  version  of  the  Q-coder  whose 
probability  estimator  table  supports  fast  attack.  This  enhancement  is  described 
in  JBIG  Document  N213.^’^^  The  document  authors  represent  Mitsubishi,  IBM  and 
AT&T,  originators  of  the  three  contenders.  The  history  and  properties  of  this 
coder  are  summarized  below. 

In  the  original  0-coder,  invented  by  IBM,^^®^  the  LPS  probability  estimates 
had  12  bits  of  precision.  The  coder  approximated  the  products  A*Q  and  A*P  by 
0  and  A-0  respectively,  where  A  is  the  probability  interval,  and  P  and  Q  are  the 
estimated  MPS  and  LPS  probabilities.  See  Appendix  A  for  BAG  principles  and 
notation. 

For  the  multiplication  approximations  to  be  reasonably  accurate,  the  value 
of  A  must  be  kept  near  1.  The  Q-coder  therefore  renormalized  by  doubling  both 
A  and  C  whenever  the  value  of  A  fell  below  0.75.  (The  actual  arithmetic  was 
performed  on  scaled  integers  with  implied  binary  points.)  Since  Q  is  always  0.5 
or  less,  renormalization  always  occurred  after  the  encoder  or  decoder 
encountered  an  LPS;  it  occurred  occasionally  when  an  MPS  was  encountered. 

The  probability  estimator  took  the  form  of  a  finite  state  machine.  Such  a 
machine  has  the  property  that  its  next  state  depends  upon  its  present  state  and 
its  present  inputs.  In  the  original  Q-coder,  the  present  state  (for  each  context) 
combined  (1)  an  index  to  a  table  of  estimated  LPS  probabilities,  and  (2)  the 
current  value  of  the  MPS.  Each  table  record  contained,  not  only  the  LPS 
probability  estimate  for  that  state,  but  also  two  "next  state"  (next  index)  values, 
one  for  when  the  encoded  (decoded)  symbol  was  the  MPS  and  one  for  the  LPS. 
When  the  state  was  such  that  the  MPS  probabilities  were  both  (very  near)  0.5 
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and  an  LPS  was  encountered,  then,  instead  of  moving  to  a  new  table  index,  the 
estimator  simply  switched  the  value  of  the  MPS. 

The  memory  allocated  to  each  context  contained  the  current  MPS  value  and 
the  table  index  for  that  context.  Whenever  the  encoder  (decoder)  renormalized 
(doubled  A  and  C  and  delivered  (acquired)  a  byte  of  coded  data  if  a  new  byte 
was  ready),  the  context  was  updated  with  a  new  table  index  or  a  new  value  of 
the  MPS. 

The  JPEG  (not  JBIG)  is  an  extension  of  the  original  Q-coder.  Its 

LPS  probability  estimates  have  been  extended  to  15  bits  of  precision  to  support 
greater  skew.  The  coder  also  provides  for  swapping  the  MPS  and  LPS  sub¬ 
intervals  if  the  former  becomes  smaller  than  the  latter.  This  can  happen 
because  of  the  multiplication  approximations  when  A  is  at  or  near  its  minimum 
allowed  value  of  0.75,  and  the  estimated  LPS  probability,  Q,  is  near  0.5.  Since 
this  swapping  can  occur  only  during  renormaiization,  it  adds  littie  computational 
burden  in  the  MPS  path  which,  by  definition,  is  more  probable  than  the  LPS 
path.  Detailed  flow  diagrams  of  the  JPEG  coder  are  given  in  the  reference. 

JBIG  document  N213-R0,  reference  cited  above,  describes  the  fast-attack 
enhancement  of  this  JPEG  coder.  Two  "fast-attack  iadders"  are  added  to  the 
main  "state  machine."  The  first  ladder  is  entered  initially.  The  state  transfers 
to  the  second  ladder  when  the  first  LPS  occurs,  or  a  series  of  MPS’s  carry  the 
state  off  the  iadder.®  The  state  transfers  from  the  second  ladder  to  the  main 
state  machine  when  a  second  LPS  occurs  or  a  series  of  MPS’s  carries  the  state 
off  that  iadder.  Once  the  state  enters  the  main  state  machine,  it  remains  there 


If  the  very  first  symbol  in  »  given  context  is  the  LPS,  the  state  remains  the  same  except 
for  a  switch  of  the  MPS  value. 
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for  the  balance  of  the  encoding  (decoding)  task.  The  fast-attack  ladders  serve 
to  bring  the  probability  estimates  for  any  given  context  into  the  "right  ball 
park"  with  very  few  "hits"  on  that  context. 

As  this  report  was  being  written,  a  new  revision  of  the  JPEG  specification 
was  released^^°'  This  describes  a  further  refinement  of  the  0-coder,  including 
the  fast-attack  ladders.  Carries  are  handled  without  bit  stuffing,  but  all-ones 
bytes  are  followed  by  all-zeros  bytes  to  allow  for  marker  codes.  It  is 
anticipated  that  the  ultimate  JBIG  and  JPEG  standards  will  specify  the  same 
binary  arithmetic  coder. 
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3.0  THE  STUDY  SYSTEM 
3.1  Overview 

The  study  system  simulates  the  JBIG  Base  System  as  defined  in  March, 
1990,  with  the  exception  that  the  study  system  simulates  the  progressive 
transmission  mode  only.  The  insights  gained  and  conclusions  drawn  from  the 
simulations  would  have  been  negligibly  different  had  both  progressive  and 
sequential  transmissions  been  simulated. 

The  documents  on  which  the  major  components  of  the  study  system  are 
based  are: 

PRES  JBIG  Document  N198 

TP  JBIG  Documents  N130  and  N131 

OP  JBIG  Documents  N199  and  N141 

Context  templates  JBIG  Document  N174 

AT  algorithm  JBIG  Documents  N139-R2  and  H203-RO 

BAG  Document  X3L2.8/90  -  004  and  JBIG  N213-R0 

The  study  system  was  written,  not  merely  to  repeat  simulations  performed 
by  others,  but  to  obtain  additional  data  that  may  influence  the  final  standard. 
These  data  are  specifically:  (1)  the  optimum  number  of  Image  reductions, 
sometimes  none,  to  maximize  overall  compression,  and  (2)  the  contributions  o'  the 
major  system  components,  namely  TP,  OP  and  binary  arithmetic  coding  ('.ncludirg 
AT),  to  data  compression. 

Recent  proposals  flowing  from  the  ongoing  JBIG  investigations,  but  not 
implemented  in  the  study  system,  include: 

Starting  Layer  Algorithm  with  Optima!  Templates 

Described  in  JBIG  Document  N190  Rev.  1^^’^  the  Starting  Layer  Algorithm 
transmits  a  given  image  layer  directly,  without  the  benefit  of  a  reference  layer, 
without  TP  or  DP,  and  with  optimal  context  templates  different  from  those 
specified  in  JBIG  Dccu.ment  N174.  The  stud/  system  has  the  option  of  simulating 
this  algorithm  by  treating  any  image  layer  as  the  base  layer,  but  employs  the 
base  layer  template  described  in  JBIG  document  N174.  Because  the  Starting 
Layer  Algorithm  does  not  employ  any  prediction.  It  is  much  simpler  than  the 
general  algorithm,  and  is  a  serious  contender  for  the  JBIG  standard. 
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At  In  Transmitter  Only 

JBIG  Document  N204-R0,  reference  cited,  specifies  that  AT  processing  be 
done  only  in  the  transmitter.  Template  changes  are  transmitted  vjhen  necessary; 
they  are  so  infrequent  that  the  overhead  is  negligible. 

Pure  Binary  Arithmetic  Coder  Output 

The  JPEG/Fast-Attack  coder,  described  in  JBIG  Document  N213-R0  and 
JPEG  Document  X3L2/90  -  004  (references  cited),  manages  carries  by  bit 
stuffing.  "Pure"  arithmetic  coder  output  refers  to  a  bit  stream  that  exactly 
represents  the  binary  fraction,  C,  to  the  precision  required  for  decoding.  Pure 
output  relieves  the  decoder  of  the  need  to  test  for  and  process  stuff  bits  or 
other  special  data  required  to  cope  with  the  carry  problem. 

JBIG  document  proposes  a  method  of  generating  pure  output.  It 

consists  of  counting  consecutive  bytes  of  all  1’s  instead  of  delivering  them  to 
the  bit  stream.  When  a  byte  *’!at  is  not  all  1’s  is  encountered,  the  previous 
such  byte  is  still  in  the  output  buffer,  but  has  not  yet  been  transmitted. 

If  it  is  determined  that  a  carry  is  being  propagated  into  a  string  of  all-1 
bytes,  it  is  added  to  the  byte  preceding  the  all-1  bytes,  and  then  as  many  bytes 
of  ail  O’s  are  output  as  there  were  ali-1  bytes.  If  no  carry  is  being  so 
propagated,  then  the  counted  number  of  all-1  bytes  are  output.  In  either  case, 
after  the  required  number  of  bytes  are  output,  the  counter  is  reset  to  0. 

A  32-bit  counter  would  never  overflow  with  an  image  of  any  practical  size, 
even  if  all  the  byies  were  all  I’s,  the  probability  of  which  is  minuscule.  The 
reference  states  that  the  all-1 ’s  byte  count  never  exceeded  4  for  any  image 
processed. 

The  pure  output  is  implemented  in  the  May  1990  JPEG  specification  (JPEG- 
8-R5.2,  reference  cited),  but  each  byte  of  all  ones  is  followed  by  one  of  all  zeros 
to  allow  the  insertion  of  marker  codes  into  the  code  stream.  This  is  called  byte 
stuffing.  In  an  environment  that  never  uses  marker  codes,  byte  stuffing  would 
be  unnecessary. 

The  study  system  consists  of  three  program:  PRES2,  JBENCODE  and 
JBDECODE.  The  system  produces  compression  statistics  and  counts  the  numbers 
of  TP  and  DP  pixels  in  difference  layers  to  determine  the  contributions  of  the 
system  components  to  data  compression. 
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During  development  and  testing,  each  program  produced  intermediate 
printouts  so  that  the  innermost  workings  of  the  algorithm  could  be  monitored. 
Special  small  test  "images"  were  contrived  to  test  all  aspects  of  every  part  of 
the  system.  The  intermediate  printouts  were,  of  course,  .oabled  during 
production  simulations. 

3.2  Programs 

3.2.1  Program  PRES2 
Purpose 

To  reduce  an  image  to  half  its  size  in  both  directions.  If  either  dimension 
is  odd,  the  reduced  image  dimension  is  rounded  up.  The  name  PRES2  arises 
because  the  original  PRES  program  performed  the  actual  computations  and 
exception  searches;  PRES2  employs  a  look-up  table.  PRES2  is  executed 
repeatedly  to  produce  images  of  successively  lower  resolution,  each  execution 
performing  one  reduction. 

Inputs 

PRES  look-up  table  file  name 
File  name  of  image  to  be  reduced 
Reduced  image  file  name 

Width  and  height  in  pixels  of  image  to  be  reduced 
Outputs 

Reduced  image 

Printout  of  the  reduced  image  dimensions 

3.2.2  Program  JBENCODE 
Purpose 

To  simulate  the  transmission  of  one  image  layer,  difference  or  base. 

Inputs 

File  name  of  image  layer  being  encoded 
Base  or  difference  layer  option 

File  name  of  reference  layer,  not  applicable  to  base  layer 
Prediction  option,  not  applicable  to  base  layer 
DP  rules  file  name  if  DP  is  employed 
Compressed  data  file  name 
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Encoder  statistics  file  name  (print  file) 

Option  to  enable  AT 

Width  and  height  (pixels)  of  image  being  encoded 

Width  and  height  of  reference  layer,  not  applicable  to  base 

layer 

The  prediction  option  is  TP,  DP  or  both  (TP  tried  first),  and  is  not  applicable  to 
the  base  layer.  The  AT  option  is  valid  for  base  or  difference  layer  encoding. 

The  JBIG  Base  System  is  simulated  by  invoking  both  TP  and  DP  and  by  enabling 
AT. 

Outputs 

Compressed  data  file 
Statistics  print  file 

The  statistics  print  file  gives  the  simulation  results.  In  addition  to  the 
compressed  data  bit  count,  it  contains  other  data  designed  to  give  insight  into 
what  the  JBIG  algorithm  does.  These  statistics  are  described  in  Section  4.3. 

3.2.3  Program  JBDECODE 
Purpose 

To  simulate  the  reception  of  one  image  layer,  difference  or  base. 

Inputs 

The  inputs  are  the  same  as  for  JBENCODE  except  that  the  first  input  is 
the  name  of  the  file  where  the  decoded  image  is  to  be  written,  and  the  statistics 
file  name  specifies  where  the  decoder  statistics  are  to  be  placed.  The  options 
must,  of  course,  match  those  invoked  in  the  encoder  simulation  to  ensure 
accurate  decoding  of  the  image. 

Outputs 

Decoded  image 
Decoder  statistics 

The  decoder  statistics  are  much  less  elaborate  than  those  produced  by  the 
encoder,  since  the  decoder  duplicates  what  the  encoder  does.  The  decoder 
statistics  were  designed  merely  to  check  the  software. 

Because  the  JBIG  algorithm  produces  lossless  image  transmission,  the  only 
purpose  of  simulating  the  decoder  was  to  verify  the  softwcre  integrity  of  the 
encoder.  During  development  and  testing,  the  decoder  was  always  simulated.  In 
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production  simulations  the  decoder  was  simulated  only  occasionally  to  give  spot 
checks. 
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4.0  SIMULATIONS 

4.1  Simulation  Objectives 

The  simulation  objectives  were;  (1)  to  prove  that  the  study  system 
simulates  the  JBIG  Base  System  algorithm,  and  (2)  to  accumulate  and  document 
data  not  published  in  any  JBIG  literature.  Specifically,  these  data  are:  (1)  the 
total  compression  as  a  function  of  the  number  of  reductions,  and  (2)  the  data 
compression  contributed  by  the  major  parts  of  the  algorithm  (TP,  DP,  the  binary 
arithmetic  coder,  etc.)  during  difference  layer  processing.  These  data  could 
have  considerable  influence  on  the  compromise  between  data  compression  and 
system  complexity. 

4.2  Test  Images 

The  test  images  are  referred  to  as  the  Stockholm  images  in  most  JBIG 
documents.  Table  4.1  summarizes  these  images.^^^^  The  starting  image 
resolutions  are  400  dots  per  inch;  five  PRES2  reductions  produce  images  having 
resolutions  of  200,  100,  50,  25  and  12.5  dots  per  inch.^ 

4.3  Encoder  Simulation  Statistics 

The  encoder  simulator,  JBENCODE,  reports  the  following  statistics: 
o  Number  of  compressed  data  bits 
o  Number  of  TP  pixels  (difference  layers  only) 
o  Number  of  DP  pixels  (difference  layers  only) 
o  Number  of  bits  sent  to  the  binary  arithmetic  coder 
o  Number  of  TP  exception  lines  (if  TP  is  invoked) 
o  Number  of  floating  pixel  changes  (if  AT  is  invoked) 

The  encoder  simuiator  reports  the  image  line  number  where  each  floating  pixel 
change  occurs. 

To  give  insight  into  the  performance  of  the  binary  arithmetic  coder  in  the 
JBIG  environment,  the  encoder  simulator  periodically  produces:  (1)  a  histogram 
showing  the  number  of  contexts  lying  in  various  ranges  of  number  of  "hits," 


Actually,  PRES2  halves  the  number  of  pixels  per  line  and  number  of  lines.  The 
"resolutions"  would  be  those  observed  were  the  reduced  images  enlarged  to  the  starting 
size. 
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No. 


Type 


Dimensions 
width  X  height 


Source 


1 

Letter  type 

3072  X  4352 

Eastman  Kodak  USA 

2 

Roman  type  and  figures 

3072  X  4352 

Eastman  Kodak  USA 

3 

Japanese  news  paper 

3072  X  4352 

Eastman  Kodak  Japan 

4a 

Halftone  image  8x8  dither 

3072  X  2048 

Eastman  Kodak  USA 

4b 

Halftone  image  ERR  dif. 

3072  X  2048 

Eastman  Kodak  USA 

4c 

Half  tone  image  4x4  dither 

3072  X  2048 

Eastman  Kodak  USA 

4d 

Halftone  image  3x3  dither 

3072  X  2048 

Eastman  Kodak  USA 

5 

Hand  writing 

3072  X  4352 

Eastman  Kodak  Japan  &  USA 

6 

Mixed  document 

3072  X  4352 

Eastman  Kodak  USA 

■ 

Line  drawing  scanned 

3072  X  4352 

Eastman  Kodak  USA 

- 1 

8 

Line  drawing  generated 

3072  X  3040 

Delta  Information  Systems  USA 

9 

Error  diffused  image 

1024 X 1024 

Eastman  Kodak  USA 

10 

Computer  generated  line 

4096  X  5856 

Delta  Information  Systems  USA 

Table  4.1  Test  Image  Descriptions 
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and  (2)  a  weighted  average  of  the  context  indices  into  the  probability  estimator 
table.  The  higher  the  index  for  any  given  context,  the  greater  the  MPS/LPS 
probability  skew,  and  hence  the  better  the  compression  contributed  by  that 
context.  The  weighting  factor  is  the  number  of  "hits"  on  the  context.  While 
contexts  with  very  few  hits  give  less  compression  than  those  with  more  (because 
their  probability  estimates  are  less  accurate),  they  contribute  to  the  bit  stream 
less  often,  and  so  the  overall  compression  degradation  is  slight. 

4.4  Procedure 

The  foilowing  steps  were  performed  for  each  test  image: 

1.  Denote  the  original  image  as  layer  RO  (zero  reductions) 

2.  Execute  PRES2  5  times  to  produce  layers  R1,  R2,  R3,  R4 
and  R5 

3.  Execute  JBENCODE  to  encode  all  6  layers  as  if  they  were 
base  layers 

4.  Execute  JBENCODE  to  encode  layers  RO  through  R4  as 
difference  layers 

A  few  layers  of  a  few  images  were  decoded  to  check  the  encoder  software 
integrity. 

Ail  production  simulations  were  performed  with  the  system  parameters  set 
to  simulate  the  JBIG  base  system.  However,  in  a  few  tests  to  evaluate  the 
contribution  of  AT  to  data  compression,  all  prediction  was  disabled,  and  separate 
simulations  were  performed  with  AT  enabled  and  disabled.  The  results  of  all 
simulations,  including  these  specisi  tests,  are  presented  in  the  next  section. 
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5.0  RESULTS 

5.1  General  Discussion 

5.1.1  Comparison  of  Study  and  JBIG  Base  Systems 

To  prove  that  the  study  system  simulates  the  JBIG  Base  System  algorithm, 
a  comparison  was  made  between  simulation  results  reported  in  JBIG  Document 
N213-R0  (reference  cited)  and  those  obtained  with  the  study  system.  These  will 
be  called  the  reference  and  study  results  respectively. 

The  reference  results  were  taken  from  Table  3A  of  the  reference 
document.  The  simulation  parameters  v/ere:  PRES  +  TP  +  DP  +  AC  (now  AT), 
without  stripes  (i.e.  progressive  transmission),  with  minimum  termination  (of  the 
binary  arithmetic  coder),  and  v/ithout  special  codes.  The  reference  and  study 
simulators  employ  the  JPEG/FA  coder  as  specified  in  JPEG  document  X3L2.8/90  - 
004,  with  the  enhancement  for  fast  attack  added  as  described  in  JBIG  Document 
N213-R0,  both  references  cited. 

Both  sets  of  data  apply  to  full  progressive  encoding  of  the  original  400 
dots-per-inch  images,  namely;  (1)  PRES  reducing  these  images  to  produce  images 
having  resolutions  of  200,  100,  ...»  12.5  dots  per  inch,  and  (2)  encoding  the  12.5 
dots-per-inch  images  as  base  layers  and  the  remaining  images  as  difference 
layers.  The  study  and  reference  parameters  are  identical,  except  that  in  the 
study  simulations  the  bytes  required  to  produce  an  end-of-image  marker  code 
are  included  in  the  byte  count. 

Table  5.1  shows  the  results.  Column  1  gives  the  image  numoer,  column  2 
shows  the  total  compressed  data  byte  count  cited  by  the  reference  for  the 
entire  progression,  column  3  cites  the  total  byte  count  reported  by  the  study 
system,  column  4  shows  the  difference  between  the  study  and  reference  byte 
counts  (study  minus  reference)  and  column  5  gives  this  difference  as  a 
percentage  relative  to  the  reference  byte  count. 

For  all  test  images  the  results  agree  to  within  a  few  tens  of  bytes  out  of 
many  thousands,  and  in  some  cases  over  200  thousand  bytes  of  compressed  data. 
It  is  thus  apparent  that  the  study  system,  for  practical  purposes,  exactly 
simulates  the  JBIG  Base  System  algorithm  as  defined  in  March,  1990. 
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Image 

No. 

Reference 

(bytes) 

Study 

(bytes) 

Difference 

(bytes) 

Difference 

(percent) 

1 

13761 

13771 

+10 

+0.073 

2 

16026 

16041 

+15 

+0.094 

3 

149207 

149219 

+12 

+0.008 

4a 

91344 

91355 

+11 

+0.012 

4b 

113172 

113163 

-9 

-0.008 

4c 

69258 

69271 

+13 

+0.019 

4d 

113531 

113541 

+10 

+0.009 

5 

33370 

33384 

+14 

+0.042 

6 

228109 

228080 

-29 

-0.013 

7 

24850 

24865 

+15 

+0.060 

8 

7171 

7191 

+20 

+0.279 

9 

99984 

99996 

+12 

+0.012 

10 

17236 

17246 

+10 

+0.058 

Table  5.1  Comparison  of  Reference  and  Study  Results 
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5.1.2  Simulation  of  Partial  Progressions 

Partial  progressions  were  simulated  to  determine  the  optimum  number  of 
image  reductions  to  minimize  the  total  number  of  transmitted  bytes.  Layer 
numbers  RO,  R1,  R5  denote  the  number  of  reductions,  0,  1,  5,  required  to 

produce  those  iayers.  Let  be  the  number  of  compressed  data  bits  required 
to  transmit  layer  Ri  as  a  base  layer,  and  -  the  number  to  transmit  the  same 
layer  as  a  difference  layer.  (Layer  R5  is  always  transmitted  as  a  base  layer, 
since  there  is  no  layer  R6.)  Then  the  total  number  of  bits,  T(k),  required  for  a 
partial  progression  involving  k  image  reductions  is: 


T(0)  = 

T(k)  =  ^d.k-'’  ^d,0  ^ 

"he  optimum  number  of  reductions  for  minimum  total  compressed  data  is  that  k 
for  which  T(k)  is  minimum. 

These  tests  included  encoding  the  starting  image  as  a  base  layer,  as  in 
the  Starting  Layer  Algorithm  described  in  JBIG  Document  190,  Rev.  1  (reference 
cited),  except  that  the  same  base  layer  template  (JBIG  Document  Ml 74)  was 
employed  regardless  of  which  image  layer  was  treated  as  the  base  layer. 

I’able  5.2  summa^  'zes  the  results  of  simulating  these  partial  progressions. 
The  table  shows,  for  each  image,  the  optimum  number  of  reductions,  including 
'  One,  for  minimum  total  compressed  data,  and  the  compression  ratio  (number  of 
starting  image  pixels  to  total  number  of  compressed  data  bits)  achieved  for  the 
optimum  number  of  reductions. 

Data  presented  later  show,  for  each  image,  the  total  number  of  compressed 
data  bytes  as  a  function  of  the  number  of  reductions.  The  optimum  numbers 
were  C  in  the  two  error  diffused  images  and  in  the  3x3  dithered  image.  In  most 
of  the  remaining  images  the  optima  were  1;  in  a  few,  2;  and  the  optimum  never 
exceeded  2, 


5.1.3  Contributions  to  Data  Compression 

In  base  layer  processing,  data  compression  is  produced  solely  by  the 
binary  arithmetic  coder  in  concert  with  the  adaptive  context  templates  (AT).  In 
difference  layer  processing,  however,  TP  and  DP  also  contribute  significantly. 


’stolen "  from  that  of  the 


arithmetic  coder.  That  is,  if  either  TP  or  DP  or  both  were  not  used,  the 


5-3 


Image 

No. 

Optimum  Number 
of  Reductions 

Compression 

Ratio 

1 

1 

112.2 

2 

1 

111.7 

3 

2 

11.5 

4a 

1 

9.9 

4b 

0 

9.6 

4c 

1 

12.6 

4d 

0 

11.0 

5 

2 

51.4 

6 

1 

CO 

7 

2 

69.7 

8 

in 

180.2 

9 

0 

1.6 

10 

2 

177.5 

(*)  Tied  with  2 


Table  5.2  Optimum  Number  of  Reductions  and  Maximum  Compression  Ratio 
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arithmetic  coder  would  make  up  much  of  the  compression  that  was  lost.  The 
relative  contribution  of  each  (TP  or  DP)  depends  upon  whether  the  image 
rendition  is  drawing/text  or  half-tone.  In  the  next  section  the  contributions  of 
TP,  DP  and  the  coder  are  shown  in  pie  chart  form  for  each  test  image. 

In  drawings  and  text,  including  hand  writing,  TP  accounted  for  nearly  all 
the  compression  in  the  higher-resolution  layers,  the  binary  arithmetic  code'* 
provided  most  of  the  rest,  and  DP  contributed  negligibly.  For  half-tone  images, 
the  binary  arithmetic  coder  accomplished  most  of  the  compression,  DP  added  a 
significant  amount,  and  the  TP  contribution  was  negligible. 

5.2  Detailed  Discussion 

Figures  5.1  through  5.26  show  detailed  results,  a  pair  of  figures  applying 
to  each  test  image.  The  first  figure  in  each  pair  shows  the  total  number  of 
compressed  data  bytes  as  a  function  of  the  number  of  image  reductions. 

The  second  figure  in  the  pair  shows  one  pie  chart  for  each  difference 
layer,  giving  the  distribution  of  image  bits  over:  bits  not  encoded  because  of 
T3  and  DP,  encoded  bits  saved  by  the  binary  arithmetic  coder,  and  bits  actually 
transmitted.  The  whole  pie  represents  the  total  number  of  pixels  in  the 
difference  iayer,  plus  the  number  of  lines  in  the  reference  layer  used  while 
encoding  the  difference  iayer.  The  latter  number  is  the  number  of  TP  exception 
bits,  which  are  encoded  in  their  own  context,  one  per  reference  image  line. 

Thus,  the  whole  pie  represents  the  total  number  of  bits  processed  for  one  image 
iayer. 

Showing  the  contribution  of  Adaptive  Templates  (AT)  in  the  pie  charts  was 
not  feasible,  because  AT  is  an  integral  part  of  the  binary  arithmetic  coding 
process.  Document  JBIG  N133-R2  (reference  cited)  states  that  AC  (AT)  can 
sometimes  halve  the  byte  count  in  images  in  which  it  is  most  advantageous.  A 
spot  test  on  layer  R1  of  Image  4c  showed  a  9.7  percent  reduction,  and  on  layer 
R1  of  Image  4d  a  21.8  percent  reduction.  These  two  image  layers  had  already 
been  determined,  during  earlier  simulations,  to  experience  a  floating  pixel 
change  early  in  the  image  scan.  The  comparisons  were  obtained  by  encoding  the 
tv/o  layers  as  base  layers  with  and  without  AT.  The  base  iayer  mode  was 
selected  so  that  prediction  would  not  contribute  to  the  compression. 

Figures  5.1  and  5.2  give  the  simulation  results  for  Image  1,  a  text  image. 
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The  optimum  number  of  reductions  was  1,  and  the  byte  count  was  13  percent 
below  that  for  no  reduction.  TP  accounted  for  most  of  the  data  compression,  the 
binary  arithmetic  coder  provided  most  of  the  rest,  and  DP  added  a  small  amount. 

Figures  5.3  and  5.4  apply  to  Image  2,  which  contains  text  and  drawings. 
Again,  the  optimum  number  of  reductions  was  1,  with  the  byte  count  S  percent 
below  that  for  no  reduction.  TP  accounted  for  almost  all  of  the  compression. 

Figures  5.5  and  5.6  present  the  results  for  Image  3,  Japanese  writing. 

The  ucmpression  for  this  image  was  an  order  of  .magnitude  worse  than  •'or  the 
first  two  images.  The  optimum  number  of  reductions  was  2,  but  the  difference 
between  2  and  1  was  less  than  one  percent.  The  minimum  byte  count  was  10 
percent  beiow  that  for  no  reduction.  TP  accounted  for  most  of  the  compression 
in  layers  RO  and  R1,  with  the  binary  arithmetic  coder  contributing  most  of  the 
rest,  and  DP  becoming  significant  in  the  lower  resolution  layers. 

Images  4a,  4b,  4c  and  4d  are  half-tone  renditions  of  the  same  subject 
(sails),  but  were  produced  by  different  methods.  Figures  5.7  and  5.8  provide 
the  data  for  Image  4a,  where  the  dither  is  3x8.  The  optimum  number  of 
reductions  was  1,  with  a  10  percent  reduction  in  byte  count  with  respect  to  no 
reduction.  TP  was  minimal  in  all  layers,  DP  contributed  significantly,  but  the 
binary  arithmetic  coder  contributed  by  far  the  most  to  the  data  compression. 

Results  for  error  diffused  half-tone  Image  4b  are  shown  in  F’gures  5.9 
and  5.10.  The  optimum  number  of  reductions  was  0  (i.e.  base  layer  algorithm), 
with  one  reduction  requiring  12  percent  more  bytes  than  none.  TP  was 
practically  non-existent  in  all  layers,  DP  contributed  some  compression,  but,  as 
n  Image  4a,  the  binary  arithmetic  coder  produced  most  of  the  compression. 

The  results  for  Image  4c,  with  4x4  dither,  are  depicted  in  Figures  5.11  and 
5.12.  The  optimum  number  of  reductions  was  1,  and  the  improvement  with 
respect  to  none  was  19  percent.  In  layer  RO,  the  binary  arithmetic  coder 
provided  most  of  the  compression,  DP  contributed  substantially,  and  TP  was 
negligible.  DP  decreased  and  TP  increased  in  lower  resolution  laye'^s,  but  the 
binary  arithmetic  coder  contributed  more  compression  than  TP  and  DP  combined 
in  all  layers. 

Figures  5.13  and  5.14  show  the  results  for  Image  4d,  produced  by  a  3x3 
dither.  The  optimum  number  of  reductions  was  none,  with  one  reduction 
producing  a  total  byte  count  42  percent  greater  than  none,  a  ve  •  significant 
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difference.  TP  was  negligible  in  all  layers,  DP  contributed  about  the  same 
amount  in  each  layer,  and  the  binary  arithmetic  coder,  as  usual  for  half-tore 
images,  contributed  most  of  the  data  compression. 

The  results  for  Image  5,  hand  writing,  are  given  In  Figures  5.15  and  5.16. 
The  optimu.m  number  of  reductions  was  2,  with  13  percent  rrr.provement  with 
respect  tc  none,  and  ^  percent  with  respect  to  one  reduction.  TP  contributed 
the  lion’s  share  of  the  compression  in  as!  layers,  the  binary  arithmetic  coder 
providing  most  of  the  rest,  and  DP  adding  a  small  amount. 

Figures  5.17  and  5.18  show  the  results  for  Inage  5,  a  collage  of  German 
text,  white  on  black  and  black  on  white,  and  small  half-tone  pictures.  The 
cpt.mum  number  of  reductions  was  1,  with  a  total  byte  count  15  percent  below 
that  "for  none.  In  layer  RO,  T.p  and  DP  combined  contributed  roughly  as  much  to 
the  compression  as  the  binary  arithmetic  coder,  TP  much  more  than  DP.  For 
lower -resolution  layers,  the  coder  contributed  most  of  the  compression.  With 
jecreasing  resoiut.on,  the  importance  of  TP  decreased  sharply;  that  of  DP  varied 
slightly. 

For  Image  7,  a  scanned  line  drawing.  Figure  5.13  shows  that  the  optimum 
'' umber  of  reductions  was  2.  The  byte  total  was  7  percent  less  thar  that  for  tk: 
reduction,  and  1  percent  less  than  for  one  reduction.  The  pie  charts  cf  Figure 
3.2Q  show  that  TP  dominated  the  compression,  the  coder  provided  .most  of  the 
rest,  and  DP  contributed  slightly. 

Figures  5.21  and  5.22  apply  to  Image  8,  a  generated  'ine  drawing.  It  is 
interesting  to  note  that  the  computer  generated  drawings  (images  8  and  10) 
gave  roughly  2.5  times  the  compression  ratio  as  the  scanned  drawing  of  Image  7. 
For  image  S,  the  optimum  number  of  .'■eductions  was  an  exact  tie  between  1  and 
2.  The  minimum  byte  count  was  10  percent  below  that  for  no  reduction,  .s.gain, 
TP  contributed  to  most  of  the  compression,  with  the  coder  providing  most  of  the 
rest,  and  DP  adding  a  slight  a.mount. 

Image  9,  another  error  diffused  image,  gave  results  shown  in  Figures  5.23 
and  5.24.  As  in  the  other  error  diffused  image  and  the  3x3  dither,  the  optimum 
number  of  reductions  was  0,  and  one  reduction  required  15  percent  more  total 
compressed  data  bytes  than  did  none.  TP  was  virtually  nonexistent,  DP 
contributed  roughly  the  same  amount  of  compression  in  ail  layers,  and  the  coder 
provided  the  most  in  all  layers  except  R4, 
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Finally,  Figures  5.25  and  5.26  show  the  results  for  Image  10,  another 
computer  generated  "line  drawing"  (actually  a  pie  and  a  bar  chart  with  ^mall 
circles,  grids  and  hatches  used  for  fill).  The  optimum  number  of  reductions  was 
2,  with  a  13  percent  reduction  in  total  byte  count  with  respect  to  no  ^action, 
and  3  percent  with  respect  to  one  reduction.  TP  provided  nearly  all  of  the 
compression  in  the  higher  resolution  layers,  the  coder  produced  most  of  the 
rest,  with  DP  contributing  slightly. 
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Figure  5.1  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  1 
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Figure  5,2  Distribution  of  Data  Bits  for  Test  Image  1  Difference  Layers 
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Figure  5.3  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  2 
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Figure  5.4  Distribution  of  Data  Bits  for  Test  Image  2  Difference  Layers 


Total  Number  of 
Bytes  Transmitted 


180000 

160000 

140000 

120000 

100000 

80000 

60000 

40000 

20000 

0 


0 


Number  of  Reductions 


Figure  5.5  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  3 
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Figure  5.6  Distribution  of  Data  Bits  for  Test  image  3  Difference  Layers 
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Figure  5.7  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  4a 
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Figure  5.8  Distribution  of  Data  Bits  for  Test  Image  4a  Difference  Layers 
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Figure  5.9  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  4b 
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Figure  5.10  Distribution  of  Data  Bits  for  Test  Image  4b  Difference  Layers 
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Figure  5.1 1  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  4c 
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Figure  5.12  Distribution  of  Data  Bits  for  Test  Image  4c  Difference  Layers 
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Rgure  5.1 3  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  4d 
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Figure  S.14  Distribution  of  Data  Bits  for  Test  Image  4d  Difference  Layers 
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Figure  5.1 5  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  5 


5-23 


Layer  RO 


Layer  R1 


Layer  R2 


Layer  R3  Layer  R4 


Legend 

I  I  Bits  Saved  by  Typical  Prediction 

Bits  Saved  by  Deterministic  Prediction 
Imil  Bits  Saved  by  Entropy  Coder 
llllll  Bits  Transmitted 


Figure  5.16  Distribution  of  Data  Bits  for  Test  Image  5  Difference  Layers 
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Figure  5.17  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  6 
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Figure  5.18  Distribution  of  Data  Bits  for  Test  Image  6  Difference  Layers 
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Figure  5.1 9  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  7 
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Figure  5.20  Distribution  of  Data  Bits  for  Test  Image  7  Difference  Layers 
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Figure  5.21  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  8 
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Figure  5.22  Distribution  of  Data  Bits  for  Test  Image  8  Difference  Layers 
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Rgure  5.23  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  9 
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Figure  5.24  Distribution  of  Data  Bits  for  Test  Image  9  Difference  Layers 
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Figure  5.25  Total  Number  of  Bytes  Transmitted  vs.  Number  of  Reductions  for  Test  Image  10 
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Figure  5.26  Distribution  of  Data  Bits  for  Test  Image  10  Difference  Layers 
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6.0  CONCLUSIONS 


The  information  gleaned  by  simulating  the  JBIG  Base  System  algorithm  sn 
the  progressive  mode  and  by  performing  partial  progressions  with  fev/er  than  5 
image  reductions,  including  none,  leads  to  the  follov/ing  conclusions. 

Typical  prediction  (T?),  deterministic  prediction  {DP‘  and  adaptive  context 
templates  (AT)  ail  contribute  to  compression,  some  more  than  others,  dependi-ng 
upon  resolution  and  whether  or  not  the  image  is  half-tone.  TP  ccntributes  the 
most  compression  in  line  drawings  and  text  images,  including  hand  wr  tiig,  but 
is  of  negligible  importance  in  half-tone  images.  D.p  and  AT  both  contribute 
sicni'icantly  in  half-tone  images,  but  negligibly  in  drawings  and  text.  Note  that 
these  compression  results  are  based  on  the  application  of  all  parts  (TP,  09,  AT) 
cf  the  algorithm.  This  does  not  mean  that  similar  overall  compression  could  not 
be  achieved  by  omitting  part  (i.e.  T?)  and  making  up  the  com.pression  with 
another  part  (i.e.  the  arithmetic  coder).  However,  to  achieve  the  highest 
ccmp.'‘ession  n  a  progressive  system  designed  to  be  independent  of  image  type, 
TP  and  OP  should  both  be  implemented  in  difference  layer  processing,  and  AT 
should  be  implemented  in  difference  and  base  layer  processing,  at  least  if  the 
present  JBIG  templates  are  employed.- 

Th.e  binary  arithmetic  coder  greatly  reduces  the  number  of  transmitted 
pixels  with  respect  to  those  which  must  be  encoded  (are  neither  IP  .nor  DP; 
regardless  of  image  type;  hence,  it  is  concluded  that  the  JBIG  context  templates 
are  well  chosen.  This  last  conclusion  was  further  substantiated  by  observing 
the  coder  LPS  probability  estimates,  averaged  over  the  contexts,  at  various 
points  during  and  at  the  end  of  each  coding  task.  The  MPS/LPS  probability 
skew  rapidly  becomes  high  and  reaches  roughly  a  steady  state,  fluctuating 
slightly  with  varying  local  statistics  as  coding  progresses.  It  was  also  observed 
that  the  skew,  and  hence  the  compression  (of  the  encoded  data,  not  T?  or  DP 
pixels),  is  highest  in  the  highest  resolution  images,  and  decreases  as  the 
resolution  becomes  lower.  It  is  conjectured  that  this  is  because  PRES,  which 
seeks  to  preserve  detail  during  image  reduction,  removes  a  great  deal  of 
redundancy;  hence,  the  context  templates  become  poorer  predictors  as  resolution 


J3IQ  Occusenc  Ho.  190.  "Using  the  JBIG  Starting  Layer  Algoritha  for  All  Sesclutisns." 
reference  cited,  suggests  that  AT  can  be  eliminated  if  the  AT  lag  pixels  are  always 
included  in  the  context  template. 
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decreases. 

Whether  the  starting  layer  algorithm  can  compete  with  a  progression 
consisting  of  one  or  more  reductions  is  still  an  open  question.  For  400  dots  per 
inch  resolution,  the  starting  layer  algorithm  (with  the  current  JBIG  base  layer 
template)  gave  worse  compression  in  most  cases  tnan  a  progression  involving  at 
least  one  image  reduction.  Over  all  13  test  images,  at  least  one  reduction  gave 
an  average  percentage  decrease  in  total  byte  count  of  4.2  with  respect  to  no 
reduction.  When  at  least  one  reduction  was  better,  typical  improvements  were 
‘0  to  20  percent,  but  in  the  three  cases  in  which  no  reduction  was  best,  the 
degradations  incurred  by  performing  one  reduction  were  12,  l^  and  42  percent. 

The  starting  layer  algorithm  was  better  for  most  of  the  images  at 
resolutions  of  200,  and  always  better  at  resolutions  of  le~s  than  200  dots  per 
inch.  This  follows  directly  from  the  fact  that  the  total  byte  count  increased  in 
most  cases  when  the  number  of  redactions  evr^^ded  one,  and  always  when  the 
number  exceeded  two. 

The  computation  burden  of  the  JBIG  Base  Gystem  is  intense.  PRES  must 
examine  9  high-  and  3  low -resolution  pixels  for  each  low-resolution  pixel  it 
produces.  Proving  that  a  TP  cluster  exists  and  is  not  an  exception  requires 
examining  9  low-  and  4  high-resolution  pixels.  Testing  whether  one  pixel  is  CP 
requires  assc.nbling  into  a  state  an  average  of  10  other  pixels,  the  actual 
number  depending  upon  phase.  Determining  contexts  requires  assembling  7 
pixels  for  a  base  layer,  10  for  a  difference  layer,  per  pixel  to  be  encoded.  AT 
requires  maintaining  correlation  counts  for  7  AT  pixels  per  target  pixel. 

If  the  starting  layer  algorithm  can  be  made  to  perform  as  well,  or  almost 
as  well,  as  at  least  one  reduction,  then  PRES,  TP  and  DP  can  all  be  eliminated. 
(PRES  can  be  used  by  itself  to  perform  image  reduction  when  needed  for 
pu  rpc  ScS  other  than  transmitting  an  image  at  a  given  resolution.)  Use  of 
context  templates  that  always  include  the  present  AT  pixels  would  obviate  the 
need  for  AT.  It  is  therefore  recommended  that  the  starting  layer  algorithm  be 
vigorously  pursued,  at  least  for  simple  facsimile  transmission. 

For  data  base  storage,  browsing  and  retrieval,  with  various  resolutions 
from  icons  to  full-scale  images,  the  full  JBIG  algorithm  is  appropriate.  The  data 
base  system  could  employ  the  full  algorithm  to  decompress  and  expand  an  image 
to  any  desired  resolution,  and  then  use  the  starting  la'/er  algori'hm  to 
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recompress  and  transmit  the  expanded  image  to  a  user  site.  This  would  allow 
the  many  data  base  users  to  have  relatively  inexpensive  terminals  (including 
current  facsimile  machines),  with  the  data  base  system  having  the  "intelligence" 
required  to  store  and  retrieve  images  of  varying  resolution.  Similar 
functionality  could  provide  backward  compatibility  to  the  large  installed  base  of 
Group  2  facsimile  terminals.  If  the  JBIG  functionality  would  be  added  to 
terminals  that  had  softcopy  interactive  capability,  such  as  Group  4,  then  users 
of  such  terminals  would  realize  the  fuil  benefits  of  the  progressive  algorithm. 
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Appendix  A 


Principles  of  Binary  Arithmetic  Coding 

Binary  arithmetic  coders  are  based  on  the  principles  presented  I'ere.  The 
implementations  and  performances  of  specific  coders  are,  however,  widely  varied. 

Every  binary  arithmetic  coder/decoder  has  a  probability  estimator.  Assjme, 
for  simplicity,  only  one  "context,"  that  is,  at  any  given  time,  there  is  only  one 
probability  estimiate  based  on  past  binary  symbols  in  the  string  being  encoded 
or  decoded.  The  extension  to  more  than  one  context  is  accomplished  by  keeping 
separate  statistics  for  each,  as  described  in  the  body  of  this  report. 

Let  MPS  denote  that  binary  symbol  which  is  currently  estimated  to  be  the 
mere  probable;  let  LPS  denote  the  other,  less  probable,  binary  symbol.  Let  P 
denote  the  current  MPS  probability  estimate,  and  Q  the  current  LPS  probability 
estimate. 

Let  A  denote  a  probability  interval,  initially  containing  all  values  greater 
than  or  equal  to  0  and  less  than  1.  This  interval  is  denoted  by  [0,1),  where 
[a,b)  means  that  the  interval  includes  a,  but  not  b.  Let  C  denote  a  code  point, 
i.e.,  a  number  in  the  interval  [0,1).  The  value  of  C  is  what  is  transmitted,  i.e.,  a 
long  binary  fraction. 

Interval  A  is  divided  into  two  sub-intervals,  one  representing  the  LPS 
probability,  and  the  other  the  MPS  probability.  Let  the  LPS  and  MPS  sub¬ 
intervals  be  L  and  M  respectively.  Without  loss  of  generality,  assume  that  L  lies 
.^elow  M.  A  description  similar  to  what  follows  would  apply  tc  the  opposite  case, 
in  which  the  lower  sub-interval  represents  the  MPS,  and  the  upper  the  LPS 
probabil'ties. 

Assume,  for  the  present  discussion,  a  fictitious  coder  and  decoder  that  can 
perform  arithmetic  with  infinite  precision,  that  is,  express  any  value  in  [0,1) 
exactly. 

In  the  encoder,  A  is  initialized  to  [0,1),  C  to  0  and  P  and  Q  to  0.5.  (Other 
values  can  be  assigned  to  P  and  Q  if  there  is  a  priori  knowledge.)  The  initial 
value  of  the  MPS  (1  or  0)  is  assigned  arbitrarily  (unless  there  is  a  priori 
knowledge),  since  the  two  symbols  are  assumed,  at  the  outset,  to  be  equally 
probable. 
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When  the  encoder  receives  a  binary  symbol  to  be  encoded,  it  does  the 
following,  based  on  v/hether  the  symbol  is  the  MPS  or  the  LPS: 

MPS: 

Adds  the  LPS  sub-interval  to  C, 

Sets  A  =  A  *  P  (the  asterisk  denotes  multiplication). 

LPS; 

Leaves  C  alone. 

Sets  A  =  A  *  O. 

Either  case; 

Sets  LPS  sub-interval  =  0  *  A, 

Sets  MPS  sub-interval  =  A  -  LPS  sub-interval. 

During  the  encoding/decoding  process,  the  probability  estimates  are  frequently 
updated.  If  what  was  the  LPS  becomes  more  probable  than  the  MPS,  the  value 
of  the  MPS  is  reversed.  Thus  P  and  Q  always  denote  the  probabilities  of  the 
MPS  and  the  LPS  respectively,  with  O  less  than  or  equal  to  P. 

The  decoder  is  similarly  initialized,  except  that  C  contains  the  final  value  left 
by  the  encoder.  The  required  precision  of  C  will  be  presented  momentarily. 

The  decoder  decodes  a  binary  value  as  follows; 

If  C  is  greater  than  or  equal  to  the  LPS  sub-interval,  then  .t: 

Decodes  the  MPf  value, 

Subtracts  the  LPS  sub-interval  from  C, 

Sets  A  =  A  *  P. 
else  it; 

Decodes  the  LPS  value, 

Leaves  C  alone, 

Sets  A  =  A  *  Q, 
either  case: 

Sets  LPS  sub-interval  =  Q  *  A, 

Sets  MPS  sub-interval  =  A  -  LPS  sub-interval. 

A  study  of  the  encoding  process  reveals  that,  at  any  given  time,  A 
represents  the  probability  that  a  particular  string  of  binary  decisions  was 
encoded  prior  to  that  time,  M  is  the  probability  that  this  string  was  encoded  and 
that  the  next  symbol  will  be  the  MPS,  and  L  is  the  probability  that  this  string 
occurred  and  that  the  LPS  will  be  encoded  next.  As  each  symbol  is  encoded,  the 
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new  A  is  in  effect  laid  beside  the  old  sub-interval  associated  with  that  symbol. 

As  an  MPS  is  encoded,  the  new  A  is  laid  beside  the  M  sub-interval  of  the  old  A; 
as  an  LPS  is  encoded,  the  new  A  is  laid  beside  the  L  sub-interval.  C  points  to 
the  bottom  of  the  new  A  interval.  Furthermore,  at  any  given  time,  the  interval 
[C,C+A;  is  .ncluded  in  the  sub-intervals  of  ail  the  symbols  encoded  prior  to  that 
time.  Therefore,  the  decoding  process  can  begin  with  any  value  C’  in  the  final 
inter /al  [C,C+A)  left  by  the  encoder  without  affecting  the  ability  of  the  decoder 
to  reconstruct  the  original  string  of  binary  decisions.  The  minimum  number  of 
b’ts  '•equired  In  the  compressed  data  stream  is  therefore  the  minimum  number 
required  to  express  a  value  of  C’  anywhere  in  the  final  interval  [C,C+A). 

Figure  A.i  shows  graphically,  approximately  to  scale,  the  encoding  of  4 
symibols:  MPS,  MPS,  LPS,  MPS  (e.g.  1,  1,  0,  1  if  the  value  of  the  MPS  is  1).  The 
solid  rectangles  represent  the  A  intervals,  each  of  which  is  divided  into  the  M 
and  L  sub-intervals.  The  MPS  and  LPS  symbols  are  assumed  to  have  fixed 
probability  estimates  of  3/4  and  1/4  respectively;  hence  L  and  M  are  assumed  to 
be  initialized  to  [0,1/4)  and  [1/4,1)  instead  of  [0,1/2)  and  [1/2,1).  (In  an  actual 
coder,  the  probability  estimates  usually  start  at  1/2  and  are  frequently 
updated.)  At  the  end  of  the  encoding  sequence,  C  =  121/256  and  A  =  27/255. 
Therefore,  C  +  A  =  37/64;  whence  C’  can  be  anywhere  in  [121/256,37/64).  The 
value  1/2  lies  in  this  interval,  and  is  represented  by  a  single  bit,  1,  with  a 
leadin'’  ^  ^ary  point  implied.  Thus,  only  one  bit  is  required  in  the  compressed 
data  stream  for  this  simple  example.  Note  tnat  the  horizontal  line  representing 
1/2  passes  through  sub-intervals  M,  M,  L  and  M  of  the  four  A  intervals, 
including  the  initial  A  interval  [0,1).  Thus,  C’  =  1/2  has  sufficient  precision  to 
allow  correct  decoding  of  the  four  symbols. 

Figure  A.2  shows  the  decoding  of  the  four  symbols  with  the  initial  value  o‘  C 
set  equal  to  1/2.  Note  that  C  is  always  in  the  sub-interval  representing  the 
decoded  symbol. 

In  the  fictitious  encoder  and  decoder  assumed  above,  the  values  of  A  and  its 
sub-intervals  become  smaller  and  smaller  as  the  encoding/decoding  process 
proceeds.  Therefore,  as  the  number  of  encoded  symbols  becomes  large,  the 
precision  required  to  express  these  values  quickly  exceeds  that  of  any  practical 
digital  processor.  Moreover,  the  binary  fraction,  C,  soon  becomes  too  long  to  be 
contained  in  any  register,  or  even  any  buffer  of  practical  size.  Leading  bits  of 
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C  are  therefore,  in  practice,  output  to  the  communications  channel.  This  leads 
to  the  well-known  carry  problem:  As  an  MPS  is  encoded  and  the  LPS  sub¬ 
interval  is  added  to  C,  a  carry  might  propagate  through  a  long  string  of  1’s  nto 
a  part  of  C  already  transmitted. 

Any  practical  encoder  and  decoder  typically  copes  with  these  problems  m 
the  following  manner: 

o  The  processors  employ  fixed-precision  integer  arithmetic,  with  implied 
leading  binary  point.  The  interval  A  and  the  value  of  C  are  frequently 
scaled  up,  e.g.  doubled,  so  that  precision  is  maintained.  This  scaling  is 
usually  called  renorm.alization. 

o  As  C  is  scaled  up,  leading  bits  are  periodically  sent  to  the  compressed 
data  stream. 

o  Precautions  are  taken  to  cope  with  the  carry  problem.  A  common  method 
is  called  bit  stuffing.  When  the  encoder  generates  a  predetermined  fairly 
small  number  of  successive  1’s  (e.g.  8),  it  inserts  a  "stuff  bit”  of  value  0. 
Any  carry  generated  later  is  trapped  by  this  0  if  not  to  its  right.  The 
decoder,  upon  receiving  the  predetermined  number  of  I’s,  arithmetically 
adds  the  stuff  bit  (which  can  be  1  or  0,  depending  on  whether  a  carry 
propagated  into  it)  to  the  value  of  C  already  received  to  correct  its 
vaiue.' 

o  The  encoder  and  decoder  contain  identical  probability  estimators,  “he 
estimation  methods  and  frequency  with  which  the  estimates  are  updated 
are  based  on  compromises  among  data  compression  and  process:ng  speed 
and  complexity. 

o  The  processing  described  above  employs  .multiplication,  a  costly  operation 
whether  the  implementation  is  in  hardware  or  software.  Practical  systems 
use  approximations  to  eliminate,  or  at  least  minimize,  the  multiplication 
burden,  at  the  expense  of  less  compression.  Accurate  decoding  is  assured 
provided  that  the  two  sub-intervals  of  A  remain  contiguous,  greater  than 
zero,  and  do  not  overlap,  even  if  the  values  of  the  sub-intervals  do  not 
accurately  represent  the  MPS  and  LPS  probabilities.  That  is  why,  in  the 


A  means  of  producing  "pure"  output,  without  bit  stuffi.'ig, 
this  report. 


is  summarized  in  the  body  of 
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Figure  A.1  Graphical  Representation  of  Smple  Binary  Arithmetic  Encoding  Example 


Decoding  Status: 


initial 


After 

MPS 


After 

MPS 


After 

LPS 


After 

MPS 


MPS  probability  =  3/4 
LPS  probability  =  1/4 


Figure  A.2  Graphical  Representation  of  Smple  Binary  Arithmetic  Decoding  Example 
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basic  steps  shown  above,  sub-interval  M  is  set  equal  to  A  -  L,  thus  ensuring 
that  M  +  L  =  A,  even  if  M  and  L  are  only  approximately  the  optimal  values  for 
best  compression. 
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September  1,  1988 


Ms.  Jane  Doe 

999  Parkside  Avenue 

Buffalo,  NY  14214 

Dear  Ms.  Doe: 

This  is  to  confirm  our  meeting  of  September  9,  1988.  I  am 

enclosing  an  Itinerary  and  directions  for  reaching  the  Elmgrove 
Plant. 


We  look  forward  to  meeting  you  next  week.  If  you  have  any 
questions,  please  feel  to  contact  me  at  (555)  526-8769. 

Sincerely, 


Frank  Weiner 
Supervisor 

Image  Electronics  Center 
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ig^n.  hs  Konntt’  aucn 
^^den  und  nicht  mehr  /uru| 
lllen  wurdeschon  stimmen. 
jiberlicB  den  Cast  der  NichtJ 
ghm  den  bcstellten  Nicrsteinij 
Bin  selbcr  an  den  Fensterti 
ijso  erfrischendcs  Liiftchen 
J  jiinnen  Strohglascr  danebet 
SgenheileniJenutzt  wurden. 
i  anstoBen?"  fragle  Dr.  Taut 
J’m-schmeckt^  mir-nicht- 
ft  etwas  gefalirlich,  wCTn, 

_  muB. 

fnich  zu  zieren,  ein  z\veites 
_  J:e  yoll  und  stieB  mit  ihm  an| 
|§iut,  daB  ich  Ihren  N'amen  in  i 
"Ich  hatle  stmsf  gaF  nicl 

immer  praktisch",  er^ndcr 
nicht  meine  Sachc.  Aber| 
Pnan  leicht  sein  Eigentum  vei 
Jie.  Sie  kbnnte  der  Lust,  ihj 
j|  er  niKh  nie  hier  gevvesen 
Jvte  ihn  entrustet  an,  als  ei 
J.K'h  M''  sagteMeargerlicli 
S.T  meine  Mappe  hierher’" 
p  Sie  strafen  sich  selber  Liii 

sie  "Darauf  trinke  ich 
ftlhnen  an,  uenn  Sie  gestehej 

|te",  sagte  cr,  sein  Weinglas 
famlich  uirklich  nicht,  xyie 
I  Ich  .kann  mir  nur  denkenJ 


Berlinfh-iert  in  beiden  I 
dung.  Kn  S'Chiineberger  j 
heitsgiockelauten.  wvnf 
beth  H.  vonCretJbritaml 
Reagan  und  der  Iran/rt 
n.uh  Berlin  kommen.  B 
scheidplat/  an  der  Bed 
danun. 

"Er  geht  j.i  in  der  U“ 

sanft  und  wandte  sich  urn,  denn  die  lur  zum  LOKai  w-uroe  meiner  uruuer  eiuiii.ii  me.  .... 
geiittnet,  und  der  erste  Cast  dieso  TagiS  trat  ein.  Sie  war  iiKT  -den  Durst  getninken  hat  und  dann  in 
einen  Augenblick  s<»  \  erbium,  daB  sie  nicht  einm^I  guten  sehgkeit  vergaB,  die  Mappe  an  sich  zu  nehmen 
Tag  sageii  konnte.  Datur  sagte  Tauborl  es,  und  sie  erkannte  Verdienst  bin  ich  nun  der  N'utznieber  dieser  Verj 
Mifurt  dip  Slininie,  die  sie  am  Teletdn  gehdrt  hatte.  Eines 
war  allerdings  kinnisclv  Sie  hatte  ein  gutes  Cedachtnis  fur 
C.esichler,  aber  seins  war  n.K'h  nii’  hier  im  SunneKichhof- 

^■fet^te  sie  •  anderswo. 
i  v  gcwcs-Mi  scin 


Stl 


anebaeh  und  waren 
Ooktur!" 

ich,  die  sich  in  der 
wei!  namiich  die- 
glaubte  K*merken 
en  sehr  wohl  wisse, 
ivein  /u  kre*den/en 
eiBwein  von  ObeT, 

^  er  mit  einein  ver- 
r  i-t  e^,  iivnii  man 

r  Me  ilin  kannle.  1  r 
ni.mn  in  der  Uein- 
”  |.•^ltl'■chc  siure  Leber 
'iht  frmnern  l:>gab 
{-••  gab  aiuh  kemen 
jien. /iim,erMen  M.il 
lc!»)>jn.ynhing,  muBle 
\ulV  j.deiiVi-.ill  stand  mmii 
le  iljni  oder  hatte  il.im 
ihfli  '>elb»T  verM'lieiikt 


Sie  soli  geMegnet  sein!  I’rost,  Fraulein  Jossie! 

I’riKit’  sagte  sie  belustigt,  slieB  mit  ihm  an 
dann  die  Mappe  aus  dem  F.inKiuschrank  henii 
Sie.  hier  steht  Ihr  Name  eingezeichnet  -  cand.meil, 

Ss’lige  Zeilen!"  lachle  TaulxTt.  "Das  l.exler  h 
ausgehallen."  •  .  ^ 

J.'sxie  nickle,  spielte  mit  dem  VerschhiB  der 
da-  etwa-  verrostete  SchloB  aiif-  und  /u-pringe 
dann  von  neuem  an. 

Tauberl  hahm  sie  ihr  behutsim  aus  den  1  lande 
sie  neben  sich  auf  das  pensterbrett.  "SiP  werdei 
denken  ktinnen.  wie-i*  ich  am  Telefon  ge-.igt  h 
Niersteiner  ofine  Sprudel  m.Khlc  -  Ich  haK* 
einer  VVeinslube  in  Munchen  gOMMsc'ii,  al-  Sie  m 
teren  Herm  hereinkamen.  L'nd  der  bestellte  so:  ' 
nut  und  Sie  haben  ihn  de-wi-gen  getadelt  und  Tirr: 
"Ich  crinnere  niieii",  s.igte  Jossie. 

■  "F rinnern  Sie  sich  auch  meiner’ ' 

.Yuch  Hirer,  ja'  -  Sie  >,iBeii  an  eineiii  der  Nely 
aBeii  gebackeiie  I  eber 

Liid  bi-  ich  auf-  und  micli  umsali,  waren  Sic 
raiilierf .  "AlH-r  der  I  linimel  hat  nur  die-eii  rei/e 
nut  der  Mappe  gescluckt,  iiiuj  aiit  den  frinkeii  wi 
( lott,  ob  leh  Sie  -onst  so  schiielf  gefuTOyn  hatte 
lossie  wollte  tragen  habeii  Sie  mich^enu  gesiu 
lieB  es  am  bi-sten  sein.  I'r  war  nichijder  er-lt' 
bistimnit  auch .  nicht  tier  let/te  seiff  Ivi  den 
i-lainmcheii  der  ZuiTeigiing  aiUglimiuen  -ah  I-.s  b 
cbeiiso  scliirelt  wieder  liPriiiiter  und  erlo'cli,  ehe  cj 
gelu’ii  verniochte.  Das  war  weiler  olua'  Nc'ue-,  I 
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